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0.  INTRODUCTION 


Some  notations:  Before  we  start,  let  us  quickly  list  some  of  the  notations 
and  conventions  that  will  be  used. 

When  an  expression  of  the  form  a^  is  a  subscript  or  exponent,  it  will  some¬ 
times  be  written  as  a(b)  for  typographical  convenience. 

The  characteristic  function  (indicator  function)  of  an  event  A  in  a  pro¬ 
bability  space  will  be  denoted  by  1^  or  1(A). 

The  complement  of  an  event  A  in  a  given  probability  space  will  be  denoted 
by  Ac. 

The  fraction  0/0  is  always  interpreted  to  be  0. 

For  any  number  p,  1  £  p  s  <»,  the  conjugate  exponent  will  be  denoted  by 
p'  (1  <  p'  5«) ;  that  is,  1/p  +  1/p •  =  1 . 

For  a  given  bounded  signed  measure  p,  the  total  variation  of  p  (on  the 
whole  measurable  space)  will  be  denoted  by  var(p) . 

The  zero  element  of  a  given  Banach  space  will  be  denoted  simply  by  0. 

For  n  =  1 ,2,3, . . . , 

[l,~]n:  =  ((Pj,...,  pn):  1  <  pk  <“  for  all  k-1,...,  n). 

Let  (fl,M,P)  be  a  fixed  probability  space.  By  a  "measure  of  dependence" 
we  mean  any  function  d  mapping  pairs  of  sub-o- fields  of  M  into 
1R+:  =  R+ u  {0}  u  {<»}  and  satisfying  the  following  two  natural  requirements: 

(0.1)  For  any  two  o-fields  F  and  GcM,  d(F,G)  =  sup  d(Fg,Gg)  where 
the  sup  is  taken  over  all  pairs  of  finite  o-fields  Fg  c  F  and  GQcG;  and 

(0.2)  d(F,G)  =0  if  and  only  if  F  and  G  are  independent  o-fields. 

As  a  consequence  of  eqn.  (0.1),  d  is  "increasing":  If  F^  c  F  and 
Gx  c G  then  dfF^Gj)  s  d(F,G) . 
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This  paper  is  mainly  concerned  with  "dominations"  of  measures  of  depen¬ 
dence  in  the  following  sense: 

We  say  that  a  measure  of  dependence  d^  is  "dominated"  by  another  measure 

of  dependence  d^  (and  we  write  dj^  d2)  if  there  exists  a  function  t:  if+-*-]R+ 

such  that  $>(0)  =0,  $  is  continuous  at  0,  and  d^  s  <I>(d2) 

(i.e.  dj (F,G)  s  4>(d2 (F,G) )  for  all  pairs  of  a-fields  F  and  G  c  M)  . 

If  dj-^d^  and  d2^d^,  then  we  say  that  d^  and  d2  are  "equivalent". 

Note  that  if  (Fn>Gn),  n  =  l,2, _  is  a  sequence  of  pairs  of  a-fields 

such  that  d-(F  ,G  )-*•()  as  n-*°°,  then  for  any  d,-4d_  we  have  that 
2  n  n  12 

d^  (Fn,Gn) -►0.  Conversely,  if  dj  (Fn  .G^) 0  as  n-*00  for  every  sequence  of 
pairs  of  a-fields  such  that  d^F^.G^)  -*0,  then  d^d2.  (To  see  why,  con¬ 
sider  the  function  4>(t) :  =  sup{dj(F,G):  d2(F,G)  <t}.) 

Stronger  types  of  "domination"  may  be  defined  by  imposing  conditions  on 
<t>.  For  example,  <J>  linear  implies  the  strong  condition  dj<Cd2.  In  all  cases 

of  domination  studied  in  [3J  and  in  this  paper,  4(t)  may  be  taken  as  a  power 
6 

t  ,  with  6>0,  for  small  t.  This  implies  that  if  d^F^.G^)  -+-0  exponentially 
fast  then  so  does  dj(Fn,G  ).  This  type  of  property  is  of  interest  in  connec¬ 
tion  with  certain  mixing  conditions,  but  we  shall  not  pursue  this  further  in 
the  present  paper. 

Here  and  throughout  the  paper,  for  measures  of  dependence  d^  and  d2,  the 
equation  dl=d2  means  that  dj(F,G)  =  d2(F»G)  for  a11  Pa*rs  o-fields  F 
and  G,  and  the  equation  d^sd2  means  that  d^(F,G)  <  d2(F,G)  for  all  F,G. 

Thus  dj  <  d2  is  technically  a  stronger  statement  than  d^d^ 

Here  are  some  examples  of  known  measures  of  dependence  that  we  shall  be 
interested  in  later  on.  Prior  to  each  one,  the  name  of  the  corresponding 
mixing  condition  for  stochastic  processes  is  given: 


Strong  mixing. 


(0.3)  ct(F,G):  *  sup|P(AnB)  -  P(A)P(B)|,  A  e  F,  BeG 

4>-mixing, 

(0.4)  4»(F,G) :  =  sup|P(B|A)  -  P(B)j,  A  e  F,  BeG,  P(A)>0 

(//-mixing, 

(0.S)  »(F,G):  =  supl-LCAnBl..  .  1|>  A  e  F,  BeG,  P(A)P(B)  >  0 

p-mixing, 

(0.6)  p(F,G) :  =  sup|Corr(X,Y)j,  XeL^F),  Ye^G), 

X  real,  Y  real 

Absolute  regularity  ("weak  Bernoulli"), 

I  J 

(0.7)  3(F,G) :  =  sup  *5  £  l  | P (A  nB  )  -  P(A  )P(B  )| 

i=l  j=l  3  3 


where  this  sup  is  taken  over  all  pairs  of  partitions  {Aj,...,  A^} 


and 


{B  , ...,  B ,}  of  ft  such  that  A.  e  F  for  all  i  and  B.  eG  for  all  j.  To 

1  J  1  J 

put  this  another  way,  0(F,G)  =  (i$)  •var(P^^  -  Pf  x  pq)  where  p^xq  is  the 
restriction  to  F  *G  of  the  measure  on  (1  *  ft  induced  by  P  and  the  diagonal 
mapping  uj  I*  (oj.ui)  and  P p  Pg  are  respectively  the  restrictions  to  F  and  G 
of  the  measure  P. 

(/-mixing  again  (see  [2J,  [3,  Theorem  l.l(ii)],  or  [4]), 

(0.8)  A(F.G) :  •  sup  A  e  F,  BeG, 

[P(AJP£B)J* 

P(A)P(B)  >0 


Some  simple  dominations  between  the  above  measures  of  dependence  are 
given  by  simple  inequalities: 

a(F,G)  <  0(F,G)  <  <}>(F ,G)  <  ip(F,G) 
u(F,G)  £  X(F,G)  s  p(F,G) 

The  domination 

P(F,G)  <  2*^(F,G) 
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is  well  known  (see  e.g.  [9,  p.  309,  Theorem  17.2.3J),  and  it  has  been  improved 
independently  by  Denker  and  Keller  [6,  p.  506,  line  2  and  p.  516,  line  -8]: 
p(F,G)  <  2»max{<J>(F,G) ,  4>(G,F)},  and  by  Peligrad  [13,  p.  462,  eqn.  (4)]: 

If  If 

p(F,G)  <  2*<j>  (F,GJ*4»^(G,F).  Another  domination,  taken  from  [3,  Theorem  1.1], 
is  as  follows: 

P(F,G)  s  OA(F,G)*[l  -  log  X  (F,G)  ] 
where  C  is  a  universal  constant. 

The  present  paper  continues  and  complements  [3],  Now  [3]  was  motivated 
by  Rosenblatt's  [15,  Chapter  7]  use  of  the  Riesz  convexity  (interpolation) 
theorem  to  compare  mixing  conditions  on  Markov  chains.  Also,  Lifshits 
[11,  Lemma  1]  used  the  Riesz-Thorin  interpolation  theorem  in  order  to 
establish  a  moment  inequality  involving  several  measures  of  dependence.  The 
idea  of  [3]  was  to  use  operator  theory  to  try  to  develop  a  unified  approach 
to  the  study  of  a  large  class  of  measures  of  dependence,  and  to  prove  "general" 
domination  results  for  this  class.  The  main  point  in  [3]  was  to  consider  a 
particular  bilinear  form: 

(0.9)  B(f,g) :  =  H(fg)  -  (Ef) (Eg) 

defined  for  simple  functions  f  and  g  such  that  f  is  F-measurable  and  g  is 
G-measurable  (where  F  and  G  are,  say,  given  c-fieldscM).  Various  norms  of 


the  bilinear  form  B  define  a  large  class  of  measures  of  dependence.  Fol¬ 
lowing  [3],  for  1  <p,q<°°,  for  given  o-fields  F  and  G  ,  we  shall  be 
interested  in  the  (p.q)-norm  of  B  on  F  *G,  namely 


(0.10)  R  (F,G) :  = 

r  *  M 


lii(fe)  -  u:f)(i:g)l 
IlfllplUllq 


f  simple,  complex- va lued ,  F-measurable 
g  simple,  complex- valued,  G-measurable 


Later  on,  we  shall  also  be  interested  in  a  version  of  (0.10)  using  r.v.'s 
f  and  g  taking  their  values  in  Banach  spaces.  We  shall  also  consider  for 
fixed  0  ^  r,s  <  1 , 


0.11)  a  (F,G) :  = 

r .  3 


I  ni'A/'.m 


which  is  a  variant  of  R^r  j (F ,G)  in  which  only  indicator  functions  are 

being  used.  In  the  terminology  of  [16],  a  is  simply  the  "restricted” 

r  ,s 

(l/r , 1/s) -norm  of  the  bilinear  form  B.  ‘ 

Of  the  six  measures  of  dependence  in  eqns.  (0.3) -(0.8)  above,  five  can 
immediately  be  fit  into  this  framework,  as  follows: 


a  =  a 


0,0’ 


4>  =  a 


1,0’ 


ip  =  a 


1.1* 


\h’ 


p  =  R 


2,2 


(The  last  equality  P  =  R2  2  follows  from  l18.  P*  512,  Theorem  1.1]  and  a 
trivial  calculation;  restriction  to  real,  mean-zero  r.v.'s  would  have  no 
effect  on  R^  q  in  the  special  case  p  = q  = 2) .  The  remaining  one,  namely  8 
in  eqn.  (0.7),  will  be  expressed  in  terms  of  (0.9)  using  B-valued  (Banach 
space  valued)  r.v.'s  in  Section  2.2  later  on. 

When  we  wish  to  consider  a  version  of  (0.10)  for  just  real-valued  r.v.' 
we  shall  use  the  notation 


E(fg)  -  (Ef)(Eg) 

imipiuiiq 

f  simple,  real-valued,  F-measurable 
g  simple,  real-valued,  G-measurable 

\  trivial  calculation  yields 

(0.13)  R  (F ,G;  1R)  <  R  (F,G)  <  4*R  (F,G;1R) 

p,q  p,q  p,q 


(0.12)  Rn  n(F,G;lR)  :  =  sup 
P  >  M 


(and  one  can  lower  the  4  somewhat).  When  we  don't  wish  to  mention  F  and  G 
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explicitly,  the  measure  of  dependence  R  (*,*;F) 

p,q 

as  R  (F)  . 
p,q 


will  be  written  simply 


In  eqns.  (0.10)  and  (0.12)  the  restriction  to  simple  functions  will  be 
convenient,  but  it  is  obviously  a  stronger  restriction  than  necessary. 

The  following  proposition  is  well  known  and  quite  elementary: 
Proposition  0.1.  If  1  sp,qs®,  and  F  and  G  are  o-fields,  then  the  fol¬ 
lowing  two  statements  hold: 


(1)  R  (F,<?)  - 

p,q  ll<llp 

f  simple,  complex-valued,  F-measurable 


(ii)  R  (F,G;F)  =  sup 
P>  M 


|lB(f|G)  -  Ef|l„. 

Ilf  It 


f  simple,  real -valued,  F-measurable 


Using  this  proposition,  some  results  in  Rosenblatt  [15,  Chapter  7]  can  be 
transcribed  into  our  language,  including  the  following  statement  from 
[15,  p.  211,  Theorem  1]:  The  measures  of  dependence  R^  t  (and  R^  p,(F)), 
l<p<oo>  are  equivalent. 

Remark  0.2.  There  is  a  nice  connection  between  Lorentz  spaces  and  the 

measures  of  dependence  a  ,.  In  this  remark  we  shall  summarize  some  key 

r ,  s 

points  of  this  connection;  the  reader  is  referred  to  e.g.  [1]  [8]  [19]  for 

the  details  on  Lorentz  spaces.  In  fact  a  can  be  considered  as  a  Lorentz 

r ,  s 

space  norm  of  the  bilinear  form  B  in  (0.9).  Recall  (see  e.g.  [1])  that  the 
Lorentz  space  Hp<®,  may  be  defined  as  the  completion  of  the  set  of 

simple  functions  with  the  norm  ||f||p  y  =  inf  ^  I  ai  I  "p  1^^  =  f } . 

This  definition  makes  sense  also  for  p  =  °°,  and  L  .  equals  L  with 

r  OO  ^  1  ^  OO 


an  "equivalent  norm"  (see  [1]).  If  1  s  p  <  °°,  then  the  dual  space  of  L 

equals  L  f0(#:  =  (f:  supt>Qt1//p  P(|f|  >  t)  <“}.  Let  p  =  1/r  and  q  =  1/s. 

Then  s  equals  the  norm  of  B  on  L .  j  *  or,  equivalently,  if  q<°°, 

the  norm  of  the  linear  operator  E(*|G)  -  E(*)  from  L^CF)  into  L^loo(G). 

It  is  obvious  that,  with  r = 1/p  and  s  = 1/q,  one  has  a  (F,G)  (F,3) 

r,s  p,q 

On  the  other  hand,  L  ,  =>  L  for  every  p,  >p,  and  thus 

P1  Pi  1 

a  (F,G)  i  OR  (F,G)  if  p  > 1/r  and  q.  >l/s.  (Here  the  constant  C 

r»5  Pl’"l  1  1 

depends  on  r.s.p^.q^.) 

Remark  0.3.  All  of  the  measures  of  dependence  d  that  are  studied  in  this 
paper  have,  in  addition  to  eqns.  (0.1)  and  (0.2),  three  other  nice  properties 

O)  d(  v  F  ,  v  G  )  =  lim  d(  v  F  ,  v  G  ) . 
n=l  n  n=l  n  N-*00  n=l  n  n=l  n 


(b)  The  measure  d  is  given  by  a  well  defined  formula  that  is  applicable 
on  any  probability  space. 

(c)  For  finite  a- fields  F  and  G  the  quantity  d(F,G)  does  not  depend  on 
the  particular  choice  of  F  and  G  or  on  the  underlying  probability  space 
(U,M,P),  but  depends  only  on  the  matrix  (T^nB^):  l<i<I,  l<j<J) 
where  , . . . ,  Aj  (resp.  B^,...,  B^)  are  (in  any  order)  the  atoms  of  F 
(resp.  G) .  Thus  (by  eqn.  (0.1))  measures  of  dependence  can  be  formulated  in 
terms  of  properties  of  doubly  stochastic  matrices.  Some  of  the  "domination" 


results  obtained  in  this  paper  may  be  closely  related  to  matrix  inequalities 
(of  which  there  is  already  a  huge  theory). 


Remark  0.4.  Keeping  in  mind  Remark  0.3(b)  for  the  measures  of  dependence 
that  we  study,  the  statement  "d1  fails  to  dominate  d2"  will  be  used  to  mean 
that  there  exists  a  probability  space  on  which  d^  fails  to  dominate  d2<  (It 
is  easily  seen  that  in  such  cases  d^  will  fail  to  dominate  d^  on  the 


probability  space  [0,1]  with  Lebesgue  measure,  or  on  any  other  atomless 


probability  space.)  To  prove  that  fails  to  dominate  d ^  it  suffices  to 
show  that  for  different  choices  of  pairs  of  o-fields  F  and  G,  on  different 


choices  of  probability  spaces,  one  can  have  dj(F,G)  arbitrarily  small 
without  d^CF.G)  being  (arbitrarily)  small  (for  the  same  F  and  G) .  For 
then  by  taking  the  product  of  countably  many  such  probability  spaces,  one 
can  obtain  a  single  probability  space  on  which  d^  fails  to  dominate  d 
(i.e.  on  which  there  are  o-fields  F^  and  G^,  n  =  1,2,3,...  such  that 
d,  (F  ,  G  )  ->-0  as  n-*-00  but  d.(F  ,  G  )  fails  to  converge  to  0). 

In  [3]  a  detailed  analysis  of  dominations  between  a  and  R  was 

r,s  p,q 

given  in  the  "lower  triangle",  r  +  ssl  resp.  1/p  +  l/q<l.  In  par¬ 
ticular  it  was  shown  that  among  these  measures  of  dependence  in  the  lower 
triangle  there  are  only  four  equivalence  classes.  Section  1  of  this  paper 
will  complement  that  analysis.  There,  in  Section  1.1  it  will  be  shown  that 
[3,  Theorem  3.6]  (one  of  the  main  tools  for  the  study  of  measures  of  depen¬ 
dence  in  [3])  is  within  a  multiplicative  constant  of  being  sharp.  In  Section 
1.2  it  will  be  shown  that  for  g  and  ^  in  the  "upper  triangle", 
r  +  s>l  resp.  1/p  +  l/q>l,  there  arc  no  equivalencies  except  for  the 
well  known  one  oij  j=Rj  j,  In  Section  1.3  a  short  proof  of  [3,  Theorem  2.2] 
is  given,  using  a  three-step  "reiteration"  technique.  Section  1.4  contains 
sharp  domination  results  for  the  case  where  one  of  the  two  o-fields  is 
finite  with  only  two  atoms,  each  having  probability 

Section  2  is  devoted  to  measures  of  dependence  based  on  B-valued  (Banach 
space  valued)  r.v.'s.  In  Section  2.1  a  preliminary  discussion  is  given.  In 


Section  2.2  the  dependence  coefficient  8  (in  eqn.  (0.7))  will  be  fit  into 
this  B-valued  scheme,  and  it  will  also  be  shown  that  if  p>l  and  q>l 


then  R  fails  to  dominate  8. 

p.q 

tion  made  in  [3,  Theorem  4.2]. 


In  Section  2.3  we  drop  an  unnecessary  assump- 
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SECTION  j_:  THE  SCALAR  CASE 

In  Section  1  here  we  study  measures  of  dependence  based  on  random  variables 

taking  scalar  (i.e.  real  or  complex)  values,  namely  the  measures  of  dependence 

R  (F,G)  and  R  (F,G;F)  from  eqns.  (0.10)  and  (0.12).  Because  of  eqn. 

P  j  ^  P  *  ^ 

(0.13)  we  really  only  need  to  discuss  one  of  these  two  (families  of)  measures; 

following  [3]  we  shall  discuss  R^  ^(F,G)  (which  uses  complex  r.v.'s).  Of 

course  the  measures  a  will  also  be  part  of  this  discussion,  which  is  in- 

r ,  s 

tended  to  complement  [3]. 


SECTION  1.1:  SHARPNESS  OF  A  DOMINATION  RESULT 


Section  1.1  here  is  the  only  part  of  this  paper  in  which  measures  of 
dependence  between  more  than  two  a-fields  will  be  included.  In  [3],  Theorem 
3.6  was  a  key  tool  for  comparing  some  measures  of  dependence  between  two  or 
more  a-fields.  Here,  for  any  given  choice  of  parameters  meeting  the  specifi¬ 
cations  in  (3,  Theorem  3.6J,  we  shall  show  that  [3,  Theorem  3.6J  is  within  a 
constant  factor  of  being  sharp.  The  sharpness  (in  the  same  sense)  of  (3,  Theo¬ 
rem  2.1]  follows  as  an  indirect  consequence.  (Otherwise  a  sharper  version  of 
Theorem  2.1  would  lead  to  a  sharper  version  of  Theorem  3.6  by  the  same  proof, 
and  this  would  contradict  the  example  given  below.)  The  nature  of  the  example 
given  below  is  such  that  it  also  confirms  the  sharpness  (in  the  same  sense)  of 
[ 3,  Theorem  4 . 1 (vi) ] . 

Before  stating  the  result,  let  us  recall  some  terminology  from  [3].  Sup¬ 
pose  (ft,M,P)  is  a  probability  space,  and  F^ ,  F^,...,  F^  are  a-fields  cM. 
Suppose  B:  S(F^)  *...*  S(Fn)  -*■  I  is  an  n-linear  form,  where  S(F)  denotes 
the  set  of  (equivalence  classes  of)  F-measurable  complex-valued  simple  func- 

t  ions .  Suppose  £:  =  (p  , . .  . ,  p  )  where  1  <  p,  <  °°  for  all  k  =  1 , . . . ,  n. 

in  k 

Then  we  define  (as  in  [3]), 
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(1.1.1) 


(1.1.2) 


VB) : 


|B(f1.....  fn)| 

I  - — 


fk  eS(Fk)  for  all  k  =  1,. 


|B(I.  .....  I.  )| 
sup  1 _  n 


H'a  A  •••••«» 


A  "p 
n  rn 


e  for  all  k  =  1 , . . . ,  n 

Theorem  1.1.1.  Suppose  n^2  is  an  integer  and  l^pk<°°,  k  =  l,...,  n. 
Define  £:  =  (p  , . . . ,  pn) .  Then  there  exists  a  constant  C  depending  only  on 
£  such  that  the  following  holds: 

For  each  t,  0<t<l,  there  exists  a  probability  space  (fi,M,P)  and 

o-fields  F,,...,  F  c|l|  and  an  n-linear  form  B:  L  (F.)  x...*  L  (F  )  -*•  I 
I  n  00  1  00  n 

(namely  BCf^,...,  fn) :  *  E^  *...•  ffl)  -  Il”_1  Efk)  such  that  d^(B)<t 
and  ||B||  >Ct(l  -  logt)C  where  c:  =  E,  „  1/p'  with  K:  =  (k :  l<p,  <«}. 


The  constant  exponent  c  here  also  depends  only  on  p  and  is  exactly  the 
same  as  in  [3,  Theorem  3.6].  (Thus  (3,  Theorem  3.6]  is  within  a  constant 
factor  of  being  sharp.  See  also  the  Addendum  on  p.  52  of  this  report.) 

Proof.  If  €  (1,°°}  for  all  k*l,...,  n  then  the  theorem  becomes 
trivial.  So  we  assume  that  1  < Pk  < 00  f°r  at  least  one  k. 

For  each  p,  1  <  p  < 00 ,  and  each  v,  0<vz%,  define  the  function 
Gy  p:  [0 ,1] -*■  [0,1]  as  follows: 

Gy  p^x^:  =  for  0  s  x  s  h 

1:V,P(X);  •  Cv,p(1  •x)  for  185x51 

Note  that  G  is  concave  and  increasing  on  [0 ,*5]  and  hence  G  is  concave  on 
[0,1]. 


■ 

.•V.V  V  v-\ 
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■SUJWJ, W  P  *.'  ff L'_ 95  *? -■.' w  1  wz * 7  •  ?  ■ » ■  «r » ■  ■■  v  ■.■  -.■  r 


For  l<p<»  and  0  <  v  s  %  define  the  function  2  :  [0,1]  ■*■  [-1,1] 

^  *v,P(X) ;  ■  ST  Gv,plx>  •  'n'ltls- 

1  if  0  <  x  <  v15 
v(l/p)*x-1/p  if  vp  <x<h 
-v»(l/p)*(l  -  x)"1//p  if  *5  <  x  <  1  -  v** 

-1  if  1  -v**’  <x<  1 

(gv  p  is  not  defined  at  xs0,  vp  ,  15,  1  ,  1.)  Then  gy  p  is  non- increasing, 

Jjgy  p(x)dx  =  0,  and  |gy  p(x)|  si  for  all  x  at  which  gy  p(x)  is  defined. 

If  0  <  v  s  %  and  p  =  l  or  °°,  define  the  function 
gv  n=  [0,1]  -*■  [-1.1]  by 

(1.1.4) 

1  if  0<x<% 

-1  if  %  <  x  <  1 

In  this  case  too,  g  is  non-increasing,  J*  g  (x)dx  =  0,  and  |g  (x)  |  si. 

V,p  u  V  ,p  "  »r 

The  following  integral  will  be  used  later  on.  If  0  <  v  <  *5  and  l<p<°°, 

then 

(1.1.5)  jf  |g„  (x)|P'dx  =  vP  *  [  1  +  (l/p)P  (logJs  -  p'  log  v)  ] 

0  ,p 

Now  let  us  get  to  the  main  part  of  the  argument. 

Henceforth  m  denotes  Lebesgue  measure  on  [0,1J. 

Let  t,  0<t<  (%)n  be  arbitrary  but  fixed.  (There  is  no  loss  of 
generality  in  specifying  0<t<0s)n  rather  than  0<t<l  as  in  the  statement 
of  Theorem  1.1.1.) 

Define  v,  0<v<Jj,  by 

(1.1.6)  vc»rd{k:  I  <PCkJ  <-}  .  t 


*v)PW:  =  1 
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V. 

k 


Define  the  probability  r/>ace  (ft,M,P)  as  follows: 
ft:  =  [0,l]n:  *  10,1]  *  [0,1]  [0,1].  M  is  the  a-field  of  Borel  subsets 

of  ft.  P  is  defined  by 

n  n 


P(A  x...x  A  )  =  n  m(A.)  +  n  /.  g  (x)dx 
1  n  k=l  K  k=l  \  v‘pk 


for  all  Borel  subsets  Aj,...,  Ac  [0,1].  (Recall  the  inequality  |g  (x) |  si 

n  v,Pk 

mentioned  above.) 

Note  that  since  J^g  (x)  =0,  each  of  the  marginal  distributions  of  p 
u  v ,p 

is  uniform  on  [0,1]. 

t’H 

For  each  k  =  l,...,  n  let  be  the  a-field  generated  by  the  k  coor¬ 
dinate  in  ft.  Define  the  n-linear  form  B:  L  (F,)  x.,,x  L  (F  )  +  (  as  follows: 

oo v  OO v  l\y 

B<fl . £n):  *  E<fl  •••••  V  - 

Proof  that  d^(B)  5  t: 

Suppose  D^eF^,  k  =  l,...,  n  with  P(Dk)  t  0.  For  each  k,  represent 

Dk  by  Dk:  =  [0,1]  x...x  [o,l]  x  br  x  [0,1]  x...x  [0,1]  (where  the  kth 

coordinate-set  Bk  is  a  Borel  subset  of  [0,1]).  Then 

lB(lD  »•••»  ^0  M  ~  P  l/g  p  (x)dxl 

U1  n  k=l  Dk  v,pk 


n  m(B,  ) 

5  "  f/0  8V  p  Cx)dx] 
k=l  ,pk 


”(V 


l  "  ,Gv  „  WV)H.  11  !/„  K  8V  D  CxJ dx|  1 

(k:l<p,  <°°)  ,pk  (k  :  p.  =1  or°°)  ,pk 


s  [  n  v(m(B.))1/pk]*[  11  m(Bk) ]  •  [  IT  1] 


(k:l  <pv<°°} 


{k:pk  = 1} 


(k:pk  =«} 


'■'s'. 


Cv‘. S'S  ^  'Sv4 -V-1>VV-  >V- .  V-  V  >!•' ->1-  -> 


(card{k:l<p(k)  <•}.  „||,  || 


"  n  "d 
k=l  uk  pk 


-  »•  n  ||i  || 

1 _ 1  ^1. 


k=l  wk  Kk 

by  eqns.  (1.1.3),  (1.1.4),  and  (1.1.6)  and  the  fact  that  for  each  0<v<*s, 

l£p£°°  the  function  g  is  non-increasing  (as  noted  earlier)  and  odd- 

v,P 

symmetric.  Thus  d^(B)  St. 

Proof  that  ||B||^2Ct(l  -  logt)c  (for  some  constant  C  depending  only  on 

£) .  (Here  c  is  as  in  the  statement  of  Theorem  1.1.1.) 

Define  the  r.v.'s  f , , . . . ,  f  as  follows: 

1  n 


V*! . xn)  ;=  ®v,pk(V  lf  pk  = 


=  1  or  «> 


.p^k 


fk(xl .  xn>  'si«"  *v.p ><xk)1’lsV,p1M  if  1<Pk< 


To  shorten  the  notation  below,  we  write  fk(x)  instead  of 

(Xj  » -  •  - ,  xk  l,x,xk+i , . . . ,  xn).  Note  that  for  each  k,  ffc  e  L<jo(Fk) 


B(f, , • • . ,  f  )  =  n  /  f k ( x) g  (x) dx 

n  k=l  0  ,pk 

For  each  k  such  that  p,  = 1  or  “,  /:!.f,  (x)g  (x)dx  =  1  and 

K  U  K  V  ,p. 


I  fk  '1 1  ^fk"<»  =  ** 


For  each  k  such  that  1  <  p.  <“,  jJ^Wg^  (x)dx  =  2*/*?|  g  (x)|Pk  dx 

*  UK  v  ,  U  V , 

aad  ||t  |l  -  „  (*)|Pk  dx]1/Pk. 


0  v,Pk 


Hence 


B(f,,...,  f  )  =  [  n  21_1/Pk[/|g  (x)|pk  dx]1“1/Pk] 
1  n  {k :  1  <  p,  <  °°}  0  v,pk 


For  each  k  such  that  1  <  < «, 

jflfL,  _  (x)|Pi  dx  *  C  *vPk  (i  -  log  t 
n  »*Fv  K 


by  eqn.  (1.1.5),  where  Ck  :*  min{l  +  (l/pk)Pk  loglj,  (1/n) - Cl/pk) ). 
(Note  that  Ck  is  positive  and  depends  only  on  £.) 

.  n 

Hence  by  eqn.  (1.1.6) ,  B(f  , . . . ,  f  )  z  C*t»(l  -  log  t )  •  II  ||f  || 

n  1/Pi  k=1  ?k 

where  c  is  as  in  Theorem  1.1.1  and  C  :=  II  Cr  k 

{k :  1  <  pk<  <*} 

and  depends  only  on  £) . 


k  (which  is  positive 


Now 


trivially  the  supremum  of  |B(flf...,  fn)  I  /  ( II  fj  II  * . . .  •  ||  fj|  )  is 


the  same  over  (f. , . . . ,  f  )  e  S(F,  )*...*  S(F  )  as  over 
l  n  l  n 

^(Fj)  x...x  Loo(Fn) .  Hence  ||b||  *  C*t*(l  -  log  t)c.  This  completes  the  proof. 


Remark  1,1.2.  [3,  Theorem  3.6]  holds  for  n  =  l  as  well  as  for  n£2,  by 

a  simple  argument  similar  to  the  proof  of  [3,  Theorem  2.1  or  2.2].  Also, 

[3,  Theorem  3.6]  is  within  a  constant  of  being  sharp  for  n  = 1 ,  as  one  can  show 
with  a  slight  modification  of  the  above  example. 


SECTION  1.2:  DOMINATIONS  IN  THE  UPPER  TRIANGLE 


The  following  definition  will  be  needed:  For  any  point  (r,s)  such  that 
0<r,ssl,  and  r  +  s>l,  define  Q(r,s)  to  be  the  closed  (convex)  quadri¬ 
lateral  region  with  vertices  (0,0),  (1,0),  (r,s),  and  (0,1). 


Remark  1.2.1.  Statements  (a) -(g)  below  give  a  complete  picture  of  what 

dominations  (or  equivalencies)  exist  for  any  two  of  the  various  measures  of 

dependence  a  and  R  with  0sr,s<l  and  lsp,qs®.  This  whole 
r ,  s  P » q 

picture  can  be  pieced  together  from  (3,  Theorem  4.1  and  Remark  4.1]  and  Pro¬ 
positions  1.2. 2-1. 2. 6  given  below;  to  do  so,  one  should  keep  in  mind  the 
trivial  facts  that  for  any  two  o-fields  F  and  G,  a  (F,G)  <  R  . 


w.(f.S) 
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and  g(F,G)  and  jys(F,G)  are  each  non-decreasing  as  r  or  s  increases. 

(In  particular,  the  problem  mentioned  in  the  last  sentence  of  [3,  Remark  4.1] 
is  solved.)  The  restrictions  0<r,s<l  and  l<p,q<»  are  to  be  implicitly 
understood. 


(a)  The  measures  of  dependence  a r  g,  r  +  s  < 1,  and  ^ ,  l/p  +  1/q  < 1, 

are  equivalent;  and  they  do  not  dominate  any  of  the  other  measures  of  dependence 

a  and  R 
r,s  p,q 

(b)  The  measures  of  dependence  a  .  ,  0<r<l,  and  R  ,,  l<p<°°  , 

r, l -r  P  >P 

are  equivalent;  and  they  dominate  o  ,  r  +  s<l,  and  R  ,  1/p  +  l/q  <  1 ; 

r,s  P,q 

but  they  do  not  dominate  any  of  the  other  measures  of  dependence  a  and 

r  •  s 


p>q 

(c)  (Sj  0  and  Rj  ^  are  equivalent.  aQ  l  and  R^  ^  are  equivalent. 
Neither  a.  _  dominates  a  . ,  nor  vice  versa,  a  and  a  each  dominate  the 

1  >  v  U  j  1  1  |U  U  ^  i 

measures  of  dependence  ars»  r  +  s<l,  Rpq*  1/p  +  l/q  <1,  ar  !  r»  0<r<l, 

and  R  , ,  1  <  p  <  °°.  Neither  a,  _  nor  aft  ,  dominates  any  of  the  measures  of 

P.P  1.0  0.1 

dependence  a  ,  r  +  s  > 1 ,  or  R  ,  1/p  +  l/q  > 1 . 

r,s  p«  R 

(d)  ai  i  =  Rj  and  these  measures  of  dependence  dominate  all  of  the 

other  ones  a  and  R 

r.s  p.q 

(c)  If  0<r  ,s_  <1  and  rn  +  s_>l,  then  a  dominates  a  and 

0  0  00*  r0’S0  r,s 

R , /  for  all  (r.s)  e Q(r  ,s  ),  except  that  a  fails  to  dominate  a.  , 

u  u  »Sq  l,u 

Y-'  "o.l-  V 1-  or  Rl/r0,l/s„-  For  such  (VS0>>  Rr„,s„  dol"inates  \,s  a"d 


0 


Rl/r,l/s  for  a11  lr,s)  e(Hr0*s0)>  excePt  that  Rr  ,s 


O’  0 

fails  to  dominate  a 


1,0’ 


R.  .  a  - ,  or  R  .  Also  for  such  (rn,sn),  neither  a  nor  R,  .  .  , 

I,®  o.l’  oo,i  0’  0"  r0’s0  l/rn,l/s, 

dominates  a  or  R1/r>1/s  for  any  (r,s)  /Q(rQ,s0). 

(f)  If  0  < sp  < 1 ,  then  all  of  Remark  (e)  holds  verbatim  for  r^  = 1 
(i.e.  for  the  point  (rQ,s0)  »(1,Sq)),  except  that  g  (and  hence  R1  ^g  ) 


dominates  a  (and  hence  also  R  ). 

1  ,  U  A 
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(g)  If  0<rg<l,  then  all  of  Remark  (e)  holds  verbatim  for  sQ  =  1 
(i.e.  for  the  point  (rn,sn)  =  (r.,1)),  except  that  a  (and  hence  R,  .  .) 

U  U  U  1/Tq,1 

dominates  a.  .  (and  hence  also  R  , ) . 

0,1  «,1' 

The  portion  of  this  picture  involving  only  g,  r  +  ssi,  and  R^  , 
1/p+l/q  *  1,  was  summarized  in  [3,  Theorem  4.1  and  Remark  4.1].  (Some  of 
that  portion  was  already  previously  known.)  The  equation  a  = R  is  well 

i  ,  I 

known  and  elementary.  What  we  have  to  do  here  in  Section  1.2  is  focus  on  the 

measures  of  dependence  a  ,  l<r  +  s<2,  and  R  ,  1  <  1/p  +  1/q  <  2.  For 

s  p  »q 

these  particular  measures  of  dependence  the  (positive)  domination  results  men¬ 
tioned  in  Remark  1.2.1(e) (f) (g)  follow  immediately  from  interpolation  theory. 
Since  our  (probability)  terminology  is  different  from  the  usual  terminology  of 
interpolation  theory,  we  shall  provide  some  of  the  details  for  the  reader's 
convenience,  in  the  form  of  Propositions  1.2. 2-1. 2. 3  below.  Then  in  Proposi¬ 
tions  1.2. 4- 1.2. 6  below,  we  shall  give  counterexamples  to  show  that  no  other 
dominations  occur  for  these  measures  of  dependence. 


Proposition  1.2.2.  Suppose  0  s  Tq.Sq.Tj ,Sj  <,  1 .  Suppose  that  each  of  the 
following  two  statements  (i)  and  (ii)  holds: 

(i)  Either  rQ  =  Tj  *  1 ,  or  rQ  t  ^  . 

(ii)  Either  s0  =  Sj=l,  or  Sq/Sj. 

Suppose  0  <  0  <  1 .  Define  r  and  s  by  r  :=  (1 -0)rQ +0^  and  s  :=  (l-e^+eSj. 
Suppose  that  the  equation  r  +  s  >1  is  satisfied.  Then  for  some  constant 
C  =  C(r0,rj .SqjSj ,0) ,  one  has  that  for  every  pair  of  o-fields  F  and  G, 


Vr.l/s^  5  c*l° 


r  (F,G)]1_e*la 

o’  o  r  i 


(F,G)f 


In  the  case  rQ  i  rj  and  s^Sj,  Proposition  1.2.2  is  an  application  of 
[19,  Theorem  2.9]  (and  Remark  0.2).  (The  bilinear  form  B  in  eqn.  (0.9)  can  be 


i 
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seen  as  a  bilinear  operator  into  Lp (fJ ^  where  ^  is  trivial,  consisting 

of  just  a  single  point;  with  this  interpretation  B  "fits”  the  hypothesis  of 
[19,  Theorem  2,9].)  In  the  case  r^  =  r^  =  1  and  s^^Sj,  one  can  take  r^  and 
Tj  out  of  the  picture  by  an  application  of,  say,  [3,  Lemma  3.7(i)],  and  then 
apply  [19,  Theorem  2.9]  in  an  appropriate  way  to  deal  with  sQ,  s1  and  s.  The 
other  cases  in  Proposition  1.2.2  are  then  obvious. 

Proposition  1.2.3. 

(i)  If  0  <  r_,srt  <  1  and  r.  +  s.  >  1,  then  a  dominates  R,  .  .  .  for 

O’  0  00*  r0*s0  1/r.l/s 

every  (r,s)  e Q(rQ,s0)  -  { 0  ,0) , (rQ,s0) ,  (0 ,1)  } . 

(ii)  If  0  < s„  <  1  then  a.  dominates  R,  ,  ,,  for  every 

0  l,sQ  l/r,l/s  7 

(r,s)  e  Q(  1  ,sQ)  -  {(  1 ,sQ) , (0,1)}. 

(iii)  If  0  <  r  <  1  then  a  ,  dominates  R, .  for  every 

0  rQ,l  l/r,l/s  7 

(r,s)  £Q(rfll  1)  -  {(1,0), (rQ,  1)}. 

Proof.  The  proofs  of  all  three  parts  are  similar,  so  we  shall  only  prove 

(i) .  Let  Sj  denote  the  line  segment  with  endpoints  (rQ,s0)  and  (1,0),  and  let 

S2  denote  the  line  segment  with  endpoints  (rQ,s0)  and  (0,1).  Recall  that 

(trivially)  a,  .51  and  a_  <  1 .  By  Proposition  1.2.2,  a  dominates 

1»°  0,1  r0’S0 
Rl/r  1/s  for  every  (r,s)  c  [8^82]  -  { (rQ ,sQ)  ,  (1 ,0)  ,  (0 , 1)  } .  Also,  trivially, 

a  dominates  n  and  hence  also  measures  R,  .  ,  r  +  s<l  (which  are 

rQ,s0  0,0  1/r.l/s’ 

equivalent  to  aQ  0  as  noted  in  [3,  Remark  4.1]).  Finally,  the  remaining  points 
(r,s)  in  the  interior  of  Q(r0,sQ)  each  lie  on  some  line  segment  with  one  end¬ 
point  on  [SjuSj]  -  { (rQ,s0) ,  (1 ,0) ,  (0,1)  }  and  the  other  in  {(r,s):  r  +  s<l}, 
and  hence  by  the  multilinear  Thorin  interpolation  theorem  (see  e.g.  [1,  p.  18, 

Exercise  1 3 ] )  a  dominates  R. .  ..  for  every  such  point  (r,s). 

r0’  0 

Now  we  are  ready  for  the  construction  of  counterexamples.  A  few  trivial 

facts  are  worth  keeping  in  mind.  The  quantity  |P(AnB)  -  P(A)P(B) j  remains 

c  c 

unchanged  if  A  is  replaced  by  A  ,  or  B  by  B  ;  consequently,  one  always  has 
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ar  .(F.6) 


sup  |P(AnB)  -  P(A) PfB) | 
[P(A)]r[P(B)]S 

A  e  F,  B  e  G, 

P(A)  <  %,  P(B)  <  % 


Also,  if  F  and  G  are  finite  o-fields,  each  having  exactly  two  atoms,  then 
Rl/r  i/s(F’G)  5  4af  S(F,G)  by  a  trivial  argument. 

Proposition  1.2.4.  The  following  two  statements  hold: 


(i) 

If 

1  <ps® 

then 

R  i 
P.l 

does  not 

dominate  a  _. 

i  ,  u 

(ii) 

If 

1  <  q  S  oo 

then 

R, 

i.q 

does  not 

dominate  a 

U  ,  1 

Proof.  By  symmetry  it  suffices  to  prove  (i).  For  each  e,  0<e<*5, 
there  exists  a  probability  space  and  a  pair  of  finite  a-fields 
F  =  {ft,A,Ac,<J>}  and  G  =  {ft,B,Bc,<j>}  such  that  P(AnB)  =  P(A)  =  e  and 
P(B)  =  Jj;  and  by  a  direct  calculation,  Rp  j (F ,G)  s  40^  jfF.G)  =  4e1/^> 
and  Q(F,G)  =>  Statement  (i)  follows,  and  this  completes  the  proof. 


Proposition  1.2.5.  Suppose  0  <  rg.Sg.r.s  < 1 ,  r0+sQ>l,  r  +  s>l,  and 

(r,s)  i  Q(r  ,s  ).  Then  R  .  does  not  dominate  a 

u  u  tq, a/Sq  r ,s 

Proof.  Let  ax+by  =  c  be  an  equation  of  a  line  containing  (r^s^)  and 
one  of  the  points  (0,1)  or  (1,0),  such  that  the  points  (0,0)  and  (r,s)  are 
in  opposite  half-planes  determined  by  that  line.  By  the  assumptions  in  Pro¬ 
position  1.2.5,  we  can  (and  do)  take  a,b,  and  c  all  positive.  Thus  ar+bs>c 
Also  (since  r0  +  sQ>l)  we  have  that  c=max{a,b}.  Define  e>0  by  the 
equation  ar  +  bs  =  c  +  e. 

For  each  n  sufficiently  large,  there  exists  a  probability  space  and  a 

pair  of  finite  o-fields  F  =  {£2, A,AC ,$}  and  G  =  (Q,B,Bc,<f)}  such  that 

P(A)  =  n  as*s,  P(B)  *  n  and  P(AnB)  =  na^+nC£.  For  such  an  n 

it  can  easily  be  checked  that  R.  ,  ..  (F,G)  <  4a.  ,  ..  (F,G)  =  4n'e 

i/rn,i/sn  l/rn,i/sn 
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and  that  a  (F ,G)  =  1.  Proposition  1.2.5  follows, 
r ,  s 

Proposition  1.2.6.  Suppose  that  0<r,s«l,  r  +  s>l,  and  either  r<l 

or  s  < 1 .  Then  a  does  not  dominate  R,  .  ,  .  . 

r,s  l/r,l/s 

Proof.  By  symmetry,  without  loss  of  generality  we  can  (and  do)  assume 
that  r<l.  (So  we  allow  the  possibility  s  =  l.) 

(In  what  follows,  the  symbols  tt,  0,  and  ip  do  not  take  their  usual 
meanings.) 

Define  the  probability  space  (ft,M,P)  as  follows:  ft  :=  [0,1]*  {0,1} 

(the  union  of  two  disjoint  intervals) ;  M  is  the  o-field  of  Borel  subsets  of 
ft,  and  P  is  defined  by 

(1.2.1)  P(Ax{0})  :=  J<)>(x)dx  and  P(A*{l})  :=  /(I  -  <$>(x))dx 

A  A 

for  every  Borel  subset  Ac  [0,1  J,  where 


(1.2.2) 


<J>(x)  :  = 


tt  *  I p(x)  if  x  e  [0,%] 


TT  - 


i^(u)du  if  x  e  (*s,l] 


where  0  <  tt  <  and 


r- 1 

ex  if  a  <  x  <  b 


0  otherwise 


where 


(log  1/tt) 
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Note  that  by  our  assumption  r  +  s>l  and  simple  arithmetic,  we  have  that 
0<a<bi%.  Also,  for  tt  sufficiently  small,  one  has  that  2*/^jj(x)dx  s  tt 
and  ip(x)  for  all  x  c  [0,%] ,  and  hence  0  s  $(x)  £  1  for  all  x  e  [0,1] . 

Consequently,  for  ir  sufficiently  small,  eqn.  (1.2.1)  does  indeed  define  a 
probability  measure.  (We  restrict  tt  to  such  small  values.) 

Define  the  "marginal"  a-fields  F  and  G  by 

F  :=  {Ax{0,l}:  Ac  [0,1]  Borel  set) 

G  :=  {[0,1]  *B:  B  =  {0,1>,  {0},  {l}, 

Define  the  event  B^  :=  [0,1]  *{0).  Note  that  the  marginal  of  P  on  [0,1] 

is  Lebesgue  measure,  and  that  P(Bq)  =  v  <  . 

We  shall  first  get  an  upper  bound  on  a  (F,G).  First  a  preliminary  cal- 

r ,  s 

r- 1 

culation  will  be  handy.  The  function  ex  is  non-increasing  on  (0,°°).  Also, 

4> (x)  -  tt  is  non-negative  if  0<xs*s,  and  negative  if  instead  %<x£l.  Hence, 
letting  m  denote  Lebesgue  measure,  we  have  that  for  every  Borel  subset  Ac  [0,1], 


J|(j>(x)  -  ir]dx  s 
A 


/  ip(x)dx  s  j  exr-1dx 
An[0,*j]  An[0  ,*s] 


m(An  f0,JsJ) 


jo,J 

0 


_  r-1  . 
ex  dx 


=  (e/r)*[m(An[0,5s])]r  <  (e/r)  •  [m(A)  ]r 


and 


h 

J|4>(x)  -  ir]dx  2  -m(An(Js,l])  *2*/ip(x)dx 
A  0 


I 

* 


•  -(e/r)*m(An(Js,l]).(J5)r'1 

2  -(e/r)  •m(An0i,l])*[m(AnPs,l])]r‘1 

*  - (e/r) • lm(A) ]r 


| 


and  hence  |/A[<J>(x)  -ir]dx|  s  (e/r) • [m(A) ]r. 
Consequently,  it  is  easy  to  see  that 


“r,s(F'G) 


ip<yy  -  nycyi 

lP(A0)]r-|P(B0)JS 


sup 


|/A$(x)dx  -  [m(A)]*tr 
[m(A)  ]r*irs 


Ac  [0,1]  Borel  set 


(e/r) » [ni(A)  |r 
[m(A)  ]r»us 


Ac  [0,1]  Borel  set 


1 _ 

r* (log  l/TT)1_r 


Note  that  a  (F,G)  becomes  arbitrarily  small  for  it  sufficiently  small.  (We 
are  using  our  assumption  r<l  here.) 

Now  we  only  need  to  show  that  R^r  ^(F.G)  fails  to  become  small  with  it 
By  Proposition  0.1  and  symmetry,  we  have  for  p  :=  1/r  and  q  :=  1/s  (so  that 
1/p'  =  1  -  r), 


l|E(g|F)  -  Eg j|  . 
R  (F,G)  =  sup  - E. 

p,q  iuil. 


g  complex- valued,  simple,  G-measurable 


I|e(Ib0|F)-eiBo||  , 
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L/q  |  4>  C.x)  -  TTjP'dx]  1/P’ 
IT 

>  - 2 - 

S 

7T 

r  rb ,  r-l.p'  ,1-r 
f /a(e-x  )F  dx] 


, rb  -1  ,  .  1-r 
e*[Jax  dxj 

TTi 


- i - r-  (logb/a)1_r 

(log  l/TT)i_r 


1 

(log  1/TT)  1_T 


[(r=T  -  iffHiog  TV)]1'1* 


l-Sxl-r 
r  ' 


Since  r  +  s>l,  it  follows  from  elementary  arithmetic  that  — - — is  a 

1-r  r 

positive  constant.  Hence  ^(F,G)  fails  to  converge  to  0  as  tt  -*•  0.  This 
completes  the  proof. 


SfCTlON  1,3:  APPLICATION  OF  THE  REITERATION  PROCEDURE 

Here  we  give  another  proof  of  [3,  Theorem  2.2].  The  proof  is  rather  simple 
but  requires  more  interpolation  theory,  and  seems  to  be  harder  to  generalize  to 
the  multidimensional  case  as  in  [3,  Theorem  2 . 1 J .  (Such  a  generalization  of  the 
proof  might  be  possible  if  one  uses  as  a  tool  a  version  of  Zafran's  [19,  Theo¬ 
rem  2.9]  multilinear  Marcinkiewicz  interpolation  theorem  with  an  explicit  upper 
bound  on  the  constant.)  We  shall  restate  [3,  Theorem  2.2]  in  a  somewhat  loose 
form  (applicable  to  some  other  contexts  besides  probability  spaces);  the  main 
emphasis  is  on  the  proof. 


Theorem  1.3.1.  Suppose  T  is  a  linear  operator,  1  < p  < 00 ,  and 


(1. 3.1)  T:  with  norm  s  1 , 

(1.3.2)  T:  L  -*■  L  with  normal, 

(1.3.3)  T:  L  -*■  L  with  norm  <  e  <  1. 

p  p,o° 

Then 

(1.3.4)  T:  -*  with  norm  5  Ce(l  +  log  1/e)  *^p 


where  C  depends  only  on  p. 


Note  that  eqns.  (1.3.1)  and  (1.3.2)  imply  (1.3.3)  and  (1.3.4)  for  e  =  l  by 


standard  interpolation  theorems  (see  [1]).  The  point  of  Theorem  1.3.1  is  that  a 
small  norm  in  eqn.  (1.3.3)  forces  the  norm  in  eqn.  (1.3.4)  to  be  almost  as  small, 
i.e.  within  a  log  term.  (For  the  definition  of  the  Lorentz  space  L  ,  see  e.g. 
LI]  or  [19].) 


Proof.  Throughout  this  proof,  the  letter  C  is  used  only  for  multiplicative 
constants  which  depend  only  on  p.  The  value  of  C  may  vary  from  one  appearance 
to  the  next. 

The  trivial  cases  e  =  l  or  0  can  be  omitted.  Moreover,  to  cover  the  re¬ 
maining  cases,  without  loss  of  generality  we  can  restrict  e  to  small  values. 

-2 

Henceforth,  we  impose  the  condition  e<e 

Let  6  =  -l/(loge).  Then  0<6slj.  Define  pQ  and  p^^  by 
l/p0  =  (1-6) /p  +  6/1  and  1/p^  =  (1-6) /p  +  6/®.  Then  1  <  pQ  <  p  <  p^  <  °°. 

Now  we  apply  the  Marcinkiewicz  interpolation  theorem  twice,  each  time  with  an 
explicit  upper  bound  on  the  constant  in  that  theorem;  see  e.g.  [20,  Chapter  12, 


Theorem  (4.6)  and  eqn.  (4.2.1)].  In  that  way,  by  eqns.  (1.3.1)  and  (1.3.3)  we 
obtain  T:  L  -*•  L  with  norm  s  C(l/6)  ^  and  by  eqns.  (1.3.2)  and  (1.3.3) 

po  po  1/p  l  6 

we  obtain  T:  L  L  with  normsC(l/6)  .  Let  0  =  1  -  1/p.  Then 

pl  pi 

0  <  Q  <  1  and  1/p  =  (1  -  8)/Pq  + 0/p j •  By  applying  the  Riesz  (or,  in  the  complex 
case,  Riesz-Thorin)  interpolation  theorem  (see  [1])  we  have  that  T:  L  -*■  L 
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with  norm  s  C*  (1/6)  ^pe1_^=  C(-log  e)  ^p*e»e-<^  s  C(-log  e)  *^p  »e  (since  e~^=e). 
This  completes  the  proof. 


SECTION  1.4:  A  SHARP  DOMINATION  INEQUALITY 


Here  our  aim  is  to  find  a  sharp  upper  bound  on  R 

P»P 

some  cases  where  one  of  the  two  o-fields  has  two  atoms. 


based  on  a,  .  .  ,  in 

1/p.l/p' 

This  complements  Section 


4.4  of  [3]. 


Theorem  1.4.1.  Suppose  F  and  G  are  a- fields,  with  G  being  of  the  form 
G-  where  P(BQ)*ls  and  B^  =  B^.  If  l<p<«  then 

Rpip,(F,G)  S  2a1/p  l/p,(F,G).(l.(i)P'p'  1»8(2'>,/p,1/p.(f.«»)1/P'- 


Further,  this  inequality  is  sharp;  for  each  ou  [0,%]  there  exists  a  proba¬ 
bility  space  with  o-fields  F  and  G  =  {QjBq.B^ ,<f>}  where  P(BQ)  =h  and 
Bj  =  Bq,  such  that  a1/p>1/p,  (F»G)  *  a  and  Rp  pl  (F,G)  =  2a[l  -  (I)p'p' log(2a)  ]1/p'. 


In  particular,  referring  to  eqns.  (0.6)  and  (0.8), 
p(F,G)  s  2A(F,G)-[1  -  H  log  (2A(F,G))]*5  . 

Under  the  hypothesis  of  Theorem  1.4.1,  one  automatically  has  that 
“l/p  l/p'^’6^  -**•  By  the  conunents  prior  to  Proposition  1.2.4  in  Section  1.2, 
in  evaluating  a^p  j/pt(F,G)  one  only  needs  to  consider  events  A  and  B  with 
probability  s  \  .  Since  0  s  P(AnBp)  s  P(A),  one  has  that  if  P(A)  < \  then 

|P(AnB0)  -  P(A)P(BQ)|  s  (Ij)P(A)  S  Os)  [P(A) ]  1/p[P(BQ)  ] 1/p’ 

(where  B0  is  as  in  Theorem  1.4.1).  Consequently  a^p  ^.(F.G)  cannot  exceed 
*s  under  these  circumstances. 

The  proof  of  Theorem  1.4.1  will  be  given  at  the  end  of  this  section  and  will 
be  based  on  some  preliminary  lemmas. 
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We  start  from  the  purely  discrete  case,  when  F  (as  well  as  G  of  course)  is 
finitely  generated.  The  reasoning  is  close  to  that  in  [12].  We  use  the  extreme 
point  method  and  the  following  version  of  the  Krein-Milman  theorem: 

Theorem  A.  If  K  e  lRn  is  a  compact  convex  set,  then  for  each  xeK 

there  exist  extreme  points  x^,...,  xfi+1  of  K  and  non-negative  numbers 

A,,...,  X  .  such  that  I?+Jx.  *1  and  x  =  Z?+l'A.x.. 

1  n+1  i=l  i  i=I  i  i 

(Recall  that  xeK  is  called  an  extreme  point  of  K  if  x  cannot  be  represented 
as  a  non- trivial  convex  combination  of  points  of  K.) 

We  shall  be  interested  in  a  particular  compact  convex  set  of  probability 
measures  on  a  certain  measurable  space.  Let  a  e  [0 ,^5]  and  r,s  e  [0,1]  be 
fixed,  and  suppose  m  is  a  fixed  positive  even  integer.  Define  the  sets 
fij  :=  (1,2,...,  m}  and  =  fO,l).  The  measurable  space  will  be 

(accompanied  by  the  discrete  o-field).  For  ease  of  notation  we  shall  denote 
: =  { i } ,  i  =  1,2,...,  m,  and  BQ  {0},  Bj  :=  {l}.  Let  K  be  the  set  of 

all  probability  measures  on  Such  that 

(1.4.1)  P(At  xfip  =  l/m  for  all  i  =  l,...,  m, 

(1.4.2)  xBq)  =  P(fl1  xBj)  =  %,  and 

(1.4.3)  |P(AxB)  -  P(A  x  B)  |  s  a*  [P(A  x  fl)  ]r.  [p^  x  B)  ]S 

for  each  pair  of  subsets  A<={1,...,  m).  Be  {0,1}. 

Clearly  K  can  be  regarded  as  a  compact  subset  of  Fm_*.  Note  that  K  is 
convex  too.  If  P^,  P^  e  K  and  0<X<1,  then  the  probability  measure 
P  :=  (l-X)Pj+XP2  satisfies  eqns.  (1.4.1)  and  (1.4.2),  and  since  the  marginals 
of  Pj  and  ?2  (and  hence  also  P)  are  equal,  eqn.  (1.4.3)  is  also  satisfied  by  P, 
and  hence  P  e  K. 

We  shall  be  interested  in  the  o- fields 
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:=  (A*!!,:  A  c  { l ,  2 , . . . ,  m  }}  and 

In  t, 

(1.4.4) 

G  (&,  *B:  Be  {0,1}}. 

in  1 


(The  subscript  ra  serves  as  a  reminder  of  the  parameter  m  on  which  flj  *  fi2  is 
based.)  K  is  the  set  of  all  probability  measures  on  ^  * «2  with  uniform  mar¬ 
ginals  such  that  a  (F  ,G  )  s  a. 

&  r,sv  m’  nr 

Lemma  1.4.2.  Each  extreme  point  P  c  K  has  the  form 

P(Aa{.)  x  6^  =  1/ (2m)  +  F(i/m)  -  F((i  -  l)/m)  for  IS  is  m/2, 

P(Aa(i)  *  V  =  1/m  -  P(Ao(m+l-i)  *  B)  for  "l/2  +  lsiSl"' 
where  a  is  a  permutation  of  {1,2,...,  m}  and  F(x)  :=  min{x/2,  a*2“Sxr}. 


Sketch  of  proof.  Each 
A.  :=  P(A.  xBj)  -  1/ (2m) , 


numbers  satisfy 
m 


(1.4.5) 

I  A.  =  0  and 
i=l  1 

(1.4.6) 

|A.|  s  (2m)'1 

P  e  K  is 
i  *  1,2, . . . 


for  all  i 


uniquely  determined  by  the  numbers 
,  m.  By  eqns.  (1.4.1)  and  (1.4.2)  these 


—  1 , . . . ,  m . 


Since  the  property  of  being  an  extreme  point  of  K  is  not  affected  by  the 

order  of  numeration  of  events  A  ,A2,...,  Am,  to  prove  Lemma  1.4.2  we  may 

assume  without  loss  of  generality  that  A1  s A2  £  . . .  >  Am>  By  eqns.  (1.4.3) 

and  (1.4.6)  we  see  that  a  given  sequence  of  numbers  A,  > A.  £...>  A  corres- 

12  m 

ponds  to  a  probability  PeK  if  and  only  if  (1.4.5)  and  the  following  two 
equations  hold: 

k  .  , 

(1.4.7)  For  all  k  =  l,...,  m,  \  A.  s  min{^—  ,  a*(— )r*2~S}, 

i  m 

m  %  * 

(1.4.8)  For  all  k  *  1, . . . ,  m,  \  (-Ai)  s  minf^-,  a*(^)r*2~S}. 
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Using  this  information,  one  can  check  with  a  little  elementary  work  that  the 

unique  extreme  point  P  of  K  which  satisfies  A, £  A-  £  ...  ^A  is  the  one  in 

12  m 

which  equality  in  (1.4.7)  and  (1.4.8)  is  attained  for  all  k  =  l,...,  m/2,  i.e. 


A^  *  F(i/m)  -  F((i-l)/m)  for  all  ism/2, 

(1.4.9) 

A.  a  -A  .  for  all  i  £m/2  +  1, 
i  m+i-i 

where  F  is  as  in  the  statement  of  Lemma  1.4.2.  The  leirana  follows. 


Remark.  Referring  to  the  proof  of  Lemma  1.4.2,  in  the  case  where 

r  +  s*l,  the  extreme  point  PeK  given  by  eqn.  (1.4.9)  has  the  property  that 

for  A  =  {1,...,  m/2},  P(AnB1)  -  P(A) PCB^  =  a* [P(A) ]r[P(B1) ]s  and  hence 

(for  this  P)  a  (F  ,G  )  *  a.  (See  the  paragraph  preceding  Lemma  1.4.2.) 
r,s  mm 

The  same  holds  for  any  other  extreme  point  of  K  with  an  appropriate  modification 
of  the  event  A. 


Lemma  1.4.3.  Let  Aj,...,  Am  now  be  fixed  as  in  eqn.  (1.4.9). 
1  <  P»q  <  °°»  then  for  any  probability  PeK  we  have  that 


If 


R  (F  ,G  )  S  2(  [  | A. |p')1/p'm1/p 
p,qv  mm  v  L  '  l 1  ' 

i=l 


(where  F^  and  Gm  are  as  in  eqn.  (1.4.4)).  Moreover,  equality  is  attained  for 
any  extreme  point  P  of  K. 


Proof.  By  Theorem  A,  each  probability  PeK  can  be  represented  as  a 
finite  convex  combination  of  extreme  points  of  K,  i.e.  P  =  E  X  P  where 

OOO 

E0Xo  =1,  XQ  2  0,  and  the  Pa  are  the  extreme  points,  given  by  Lemma  1.4,2. 

Here  a  is  ranging  over  all  permutations  of  {1,2,...,  m}. 

Since  the  probability  measures  in  K  all  have  the  same  marginals,  we  have 

that  for  any  PeK  and  any  r.v.'s  X  and  Y  on  (1^  *  such  that  X  is  Fm- 

measurable  (i.e.  a  function  only  of  the  first  coordinate)  and  Y  is  G  -measura- 

m 

ble  (I.e.  a  function  only  of  the  second  coordinate). 
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1/XYdP  -  /XdP/YdP|  S  l  X0|/XYdP0-/XdP0/YdP0| 

O 

*  maxa|/XYdPo-/xdPa/YdPo| 

^  a.(/|X|pdPa)1/p*(/|Y|qdPo)1/q 
where  a  :  =  supx>y| /XYdP0  -  /xdPo/YdPol/(||x||p  •  || Y||q) 

(the  sup  being  taken  over  X  and  Y  as  above) .  Note  that  a  depends  on  m  but 
is  not  dependent  on  o. 

If  X  and  Y  are  as  above  and  we  add  a  constant  to  Y,  then 
|/XYdPQ  -  /xdPo/YdPo|  remains  unchanged;  but  || Y || ^  can  be  minimized  (under 
addition  of  a  constant)  by  centering  it  at  expectation.  This  latter  fact  fol¬ 
lows  from  the  nature  of  the  o- field  G  (i.e.  two  atoms,  each  with  probability 

ID 

Jj) .  Consequently, 

a  *  supx2|/XYdPa| 

where  Y  :=  I_  -  *s  and  the  sup  is  taken  over  all  F  -measurable  X  such  that 
b,  in 


From  the  representation  X  =  £™_jX^I^  we  see  that  a  is  the  maximum  of 

the  function  f(x,,...,  x  )  :=  2E®  .x.A.  under  the  restriction 
1  m  i=lii 

(I®  |  x4  |P) 1/,p  *  m1/,p.  It  is  well  known  that  this  maximum  is  2m1/p(£®_1 | Ai |p  )^p 

Lemma  1.4.3  (both  parts)  is  now  easy  to  see. 

Lemma  1.4,4.  Suppose  that  Osr,s£l  and  1  <p,q5«.  Suppose  that  F  and 

Q 

G  are  o-fields  on  some  probability  space,  where  G  has  two  atoms  B  and  B  ,  each 

having  probability  J*.  Then  there  exists  a  sequence  ,P^ ,P^ , . . •  of  probability 

measures,  where  for  each  (even)  m  the  measure  P_  is  defined  on 

m 

(1,2,...,  m}x{0,l)  and  has  uniform  marginals,  such  that 
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a  (F,G)  =  lim _ a  (F  ,G  ) 

r,sv  ’  nr*»  r,sK  m*  nr 

R  (F,G)  =  lim,..  R  (F  ,G  ) 
p,qv  m-**>  p,qv  m’  nr 

where  for  each  (even)  m  the  o-fields  F  and  G  are  as  in  eqn.  (1.4.4)  and 

mm 

ar  s^m*®m^  and  Rp  q^m’^np  are  as  de^ined  with  respect  to  the  probability 
measure  P  .  (In  the  two  limits,  m  is  restricted  to  even  integers.) 

Proof.  By  an  elementary  argument  based  on  eqn.  (0.1),  one  can  show  that 
it  suffices  to  prove  this  lemma  in  the  case  where  F  is  itself  a  finite  a- field. 
We  shall  henceforth  make  that  assumption  on  F.  Enlarging  the  given  probability 
space  if  necessary  we  may  assume  without  loss  of  generality  that  there  exists  a 
r.v.  1)  which  is  independent  of  FvG  and  has  a  continuous  distribution  function. 
By  an  elementary  argument. 


(1.4.10) 


a  (Fva(U)  ,G)  =  a  (F,G)  and 
r,s  r,s 

R  (Fvcr(U)  ,G)  =  R  (F,G) 
p.q  p.q  ’ 


(where  a(...)  denotes  the  o-field  generated  by  (...)). 

Let  Z  be  a  Fva(U) -measurable  r.v.  with  a  continuous  distribution  function 

Fz(»),  such  that  F  c  <y(Z) .  (For  example,  take  Z  =  z"_^I  ‘^(U)  where 

A(l) .  A(n)  are  the  atoms  of  F  and  f , , . . . ,  f  are  suitably  chosen  func- 

l  n 

tions.)  Then  as  a  consequence  of  eqn.  (1.4.10)  we  have  that 
a  (o(Z),G)  *  a  (F,G)  and 

R  (o(Z) ,G)  =  R  (F,G) . 

p.q  p.q 

Now  consider  the  probability  measure  P  on  [0,1]  x{0,l}  induced  by  the 
random  vector  (Fz(Z),Ig)  (where  B  is  one  of  the  atoms  of  G) .  Then  P  has 
uniform  marginals,  and  defining  the  a-fields  F*  and  G*  generated  by  the  first 
and  second  coordinates  of  this  new  probability  space,  we  have  that 


-  -  *  -  • 


-*  •"  •*  •*.  «*.  V 


«*«•*.  **• 


•  **•  ,*•  .*• 

,  *  .  "V-  •  .  •  O  V  N*  O  V  a1 


OT 
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“r.s^**6*3  3  ar,s(a(Z)  •<*)  -  ar>s(F,6)  and 
Rp>q(F*.G*)  -  Rp>q(o(Z),G)  =  Rp>q(F,G). 

For  each  even  m  let  F*  denote  the  sub-a-field  of  F*  generated  by  the  events 
{((i  -  l)/m,i/m)  x {o,l},  i  ■  1,2, . . . ,  m).  Then 


(1.4.11) 


lim  a  _(F*  G*)  =  a  (F*,G*)  =  a  (F.G)  and 
nr+°°  r,s  m  r,s  r,s 


lira _  R  (F*,G*)  =  R  (F*,G*)  =  R  (F,G) . 

nr*°°  p,qv  m’  p,qv  p,qv  J 


Now  if  for  each  even  m  we  define  the  probability  measure  on 

{1,2,...  ,  m}  x  {0,1}  by  Pm({i}  x  {j})  =  P(((i  -  i)/m,i/m)  x  {o,l}), 

j  =0,1,  then  it  is  easy  to  see  that  as  a  trivial  corollary  of  eqn.  (1.4.11) 

these  measures  P  satisfy  the  conclusion  of  Lemma  1.4.4. 
m 

Proof  of  Theorem  1.4.1.  To  prove  the  inequality  in  Theorem  1.4.1,  by 
Lemmas  1.4.3  and  1.4.4  It  suffices  to  notice  that  if  0<r<l,  s  =  l-r,  p  =  1/r 
and  hence  p'  = 1/s,  then 

uV«,2ml/P(  l  |Ailp,)1/P'  =  2a*  [1  -  (±)PV  log  (2a)  ] 1/p' 
i=l  p 

(where  A^,...,  are  as  in  eqn.  (1.4.9)). 

To  prove  the  second  part  of  Theorem  1.4.1  consider  the  probability  measure 
P  on  10,1)  x  {0 , 1 }  with  uniform  marginals,  such  that  for  every  Borel  subset  A 
of  [0,1]  (letting  =  { 1 }  as  above), 

P(AnB.)  =  /[(is)  ♦  F*  (x)  ]dx 

A 

where  F(x)  is  as  in  Lemma  1.4.2  for  0  s  x  s  *s  and  F(x)=F(l-x)  for  4  <  x  <  1 . 
By  an  argument  like  the  last  part  of  the  proof  of  Lemma  1.4.4  (and  taking  into 
account  the  Remark  prior  to  Lemma  1.4.3)  one  can  verify  that  if  0<r<l, 


.;.v,v  v,  •; 


( fvn  <rm. 


KT  ^ 


’  .  •  .  •  «  *  »  *.*  *  »  ■  •  *  »  •  ‘  »  *  • 
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SECTION  2:  THE  VECTOR  CASE 

In  Section  2  here  we  shall  examine  measures  of  dependence  which  are 
based  on  random  variables  taking  their  values  in  Banach  spaces,  i.e. 
"B-valued"  r.v.'s.  For  simplicity  we  consider  only  real  Banach  spaces. 

But  all  results  here  are  valid  for  complex  Banach  spaces  as  well.  In 
fact  every  complex  Banach  space  is  also  a  real  Banach  space  (by  restricting 
the  scalars),  and  a  simple  argument  will  show  that  the  results  for  the 
complex  case  follow  from  the  real  case. 

Much  of  the  theory  of  operators  on  spaces  of  functions  (including 
many  results  in  interpolation  theory)  has  been  extended  to  functions  taking 
their  values  in  Banach  spaces.  Most  of  the  results  here  in  Section  2  are 
entirely  elementary  and  in  many  cases  are  simply  a  reformulation,  in  our 
context,  of  results  already  known  in  functional  analysis  in  connection  with 
B-valued  functions.  Our  main  goal  here  is  to  clarify  the  connection  between 
measures  of  dependence  and  functional  analysis  involving  B-valued  functions. 

SECTION  2.1;  PRELIMINARIES 

Here  in  Section  2.1  we  shall  review,  in  our  terminology  involving 
measures  of  dependence,  some  elementary  aspects  of  functional  analysis 
involving  B-valued  functions. 

The  norm  of  an  element  x  of  a  given  Banach  space  B  will  be  denoted 

by  IMIB  • 

First  recall  that  in  the  special  case  of  a  real  Hilbert  space  H  with 
inner  product  <.,.>  ,  each  real  bounded  linear  functional  g  on  H  is  of  the 
form  g(x) :  =  <x,y>  Vx€H  for  some  fixed  y f  H  .  In  this  setting  it 
is  common  practice  to  identify  the  functional  g  with  the  element  y;  and 


with  this  identification  one  can  say  that  g(x)  =  <x,g>  .  To  put  this 
another  way,  H  is  (isometric  to)  its  own  (real)  dual  space. 

Suppose  B  is  a  Banach  space  (with  just  real  scalar  multiplication). 

If  *£  B  and  g  is  a  real  bounded  linear  functional  on  B,  then  we  shall 
use  the  notation  <x,g>  :=  g(x)  (by  analogy  with  the  Hilbert  space  context 
described  above).  The  (real)  dual  space  of  B,  i.e.  the  space  of  all  real 
bounded  linear  functionals  on  B,  will  be  denoted  by  B*.  Of  course  B*  is 
itself  a  Banach  space  (with  just  real  scalar  multiplication)  with  norm 
II-  llB*  given  by 

(2.1.1)  ||g||B*  :=  sup  I <x,g>  1  ,  xeB 

llX  II B 

=  sup  <X,g>  ,  X  £  B 

IHlB 

(The  latter  equality  is  trivial;  one  can  consider  -x  in  place  of  x.) 

If  x  eg  then  we  can  define  a  real  linear  functional  z  on  B*  by 
z(y)  :=  <x,y>VyeB*  .  That  is,  <y,z>  :=  <x,y>  Vy  £B*  .  It  is  easy 
to  see  that  z  is  a  bounded  functional,  i.e.  z  £ B**  (the  "second  dual"  of 
B) ,  with  norm  ||z||g**  =  l!xllg  •  The  identification  of  each  xeB  with  the 
corresponding  z  £ B**  is  the  standard  way  of  isometrically  embedding  B  in 
B**.  In  many  cases  B**  will  have  other  elements  besides  (the  embedding  of) 
the  ones  in  B. 

Recall  the  simple  equation 


(2.1.2) 


vy  eB*. 


z  £  B** 


llyll 


B* 


(or,  similarly,  VXf  B,  ||x||R  =  sup  |<x,y> |/ ||y|L*  , 


ye  B*)  .  The  following 
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simple  consequence  of  eqn.  (2.1.2)  will  be  useful  later  on: 

Lemma  2.1.1:  Suppose  B  is  a  Banach  space,  y  € B*  ,  z  e  B**  ,  and 

e>0  .  Then  there  exists  xeB  with  ||X||B  £  II 2 II  g**  such  that 

|<y,z>|  £  (1+e)  •  <x,y>  . 


Now  let  us  return  to  our  probability  space  (ft,M,P)  .  For  each 

a-field  A  CM  and  each  Banach  space  B,  define  S(A,B)  to  be  the  set  of  all 

A-measurable  simple  B-valued  random  variables,  that  is,  the  set  of  random 

N 

variables  of  the  form  x^  1^^  where  N  is  a  positive  integer, 

Xj,...,x^  are  elements  of  B,  and  {A^,...,A^}  is  a  partition  of  with 
c  A  Vi  .  For  each  X  CS(A, B)  and  each  p,  1  £  P  £  00  »  define 


llx 


11*11 


B  "p 


1/p 

f  [E(t|X||P)]  if  1  <  p  < 
ess  sup  ||X||  B  if  p  =  » 


N 

To  put  this  another  way,  if  X  *  xi  1^^  with  N,  Xj,...,x^,  Aj,...,^ 

as  above,  then  ||x||p  =  [Z^=1  II [| g  *  P(Ai)]1/^P  if  1  £  p  <  °°  ,  and 

l|X|L  =  max  {||x.||B  :  PCA^  >  0}  . 

For  each  p,q  with  1  £  p,q  £  00  and  each  Banach  space  B,  define  the 
measure  of  dependence  ^(B)  between  o-fields  F  and  G  as  follows: 


(F,G;B) 


sup 


1e<x,y>-<ex,ey> 

llXllp  •  llYHq 


X  eS(F,B) 

Y  eS(G,B*) 


Proposition  2.1.2:  Suppose  1  £  p,q  £  00  ,  and  B  is  a  Banach  space. 

Then  for  any  two  o-fields  F  and  G  the  following  three  statements  hold: 

(a)  For  any  Banach  space  Bj  which  is  isometric  to  B, 

Rn  n(F’G;Bi)  =  K  „CF.G;B)  . 

p.q  1  p.q 


(b)  For  any  subspace  of  B  (or  Banach  space  B^  which  is  isometric 
to  a  subspace  of  B),  Rp  ^(F,G;Bp  £  Rp  ^(F,G;B) 

(c)  Rp>q(F,G;B)  =  Tsup  Rp^F.GjBj) 

Finite-dimensional  subspaces  B^  of  B 

Proposition  2.1.2  is  elementary  and  its  proof  is  left  to  the  reader. 
The  Hahn-Banach  theorem  will  be  helpful  in  proving  part  (b) . 


Proposition  2.1.3:  If  1  £  p,q  £  00  and  B  is  a  Banach  space,  then 
for  any  two  o-fields  F  and  G,  R  (F,G;B*)  =  R  (G, F;B)  . 

p>  q  q  >  p 

Proof:  We  shall  first  prove  the  following  lemma: 

Lemma:  If  e  >  o  ,  Yf  5(F,B*)  ,  and  ZeS(G,B**)  ,  then  there 

exists  XeS(G.B)  such  that  ||X||q  £  ||Z||q  and  |E<Y-EY,Z>|  <  (1+e)  E<X,Y-EY> 

Proof  of  Lemma:  Represent  Y-EY  and  Z  by  Y-EY  =  E?  y.  I  ...  and 
r"  1-1  1  ZV^lJ 

z  *  ^j  =  1  Zj  IB(j)  where  yieB*Vi  ,  {Aj^.^Aj}  is  a  partition  of  ft 

with  A.,«Fvi  ,  Zj  c  B**  Vj  ,  and  {Bj,...,Bj}  is  a  partition  of  ft 

with  B.  e  G  Vj  . 

J  J 

Using  Lemma  2.1.1,  for  each  j=l,...,J  let  x^cB  be  such  that 

llxj  llB  i  Hzj  llB**  and 


<  E?  .  P(A.nB.)y.  ,  z .  >  1  <  (l+e)<x.  ,  E^  ,  P(A.nB.)y.> 
i=l  i  j  i  J  1  —  j  i=l  i  j  i 


X  :=  E. 
J 

=1  Xj  JB(j) 

.  Then  |fx(u>)  ||B 

hence 

v  I 

cr 

X 

INI 

.  Also, 

q 

If 

M 

II 

II 

►H  -H 

V 

H 

P(A. 

nBj)yi’  zj>! 

A 

M 

M- 

II 

>-* 

P(A. 

nB.)y.,  z.>| 

» 

(1+e)  • 

<Xi* 

E1  ,  P(A.  n  B . )y . > 
1=1  1  J  1 

(1+e)  E<X,Y-EY> 
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This  completes  the  proof  of  the  lemma. 

By  the  lemma,  for  any  Ye  S(F,B*)  ,  ZeS(G,B**)  ,  and  e  >  0, 


E<Y,Z>  -  <EY,EZ> 1  _  Ie<Y-EY,Z> 

11^ llp  Mq  =  IM!p  llzllq 


<  (l+e) 


sup  E<X, Y-EY>  ,  Xe  S(G,B) 

IDcllqllvIlp 


=  (1+e) 


sup  E<X,Y>  -  <EX,EY>  ,  X  e  S(G,B) 

11*11,  imip 


<  (l+e)  •  R 

4*  P 


Hence  R  (F,G;B*)  <  R  (G, F;B)  . 

p.q  -  q»p 


By  an  exactly  analogous  argument,  R  (G,F;B**)  £  R  (F,G;B*) 

q»p  p»q 

By  Proposition  2.1.2(b)  and  the  fact  (noted  earlier)  that  B  is  isometric 
to  a  subspace  of  B**,  R  (G,F;B)  <  R  (G,F;B**)  .  From  these  last  three 

q.p  —  q»p 

inequalities  Proposition  2.1.3  follows.  This  completes  the  proof. 

From  two  applications  of  Proposition  2.1.3  (or  from  its  proof)  we  have 


Corollary  2.1.4;  If  1  £  Pi  q  5  00  and  B  is  a  Banach  space,  then 


R  (B**)  =  R  (B) 
p,q  p»q 


1  oo 

Let  £  (resp.  I  )  denote  the  Banach  space  of  all  absolutely  summable 
(resp.  bounded)  sequences  x  :=  (ClfC2* £3* • • • )  of  real  numbers,  with  norm 


INI  !  :=  £°°b1  KJ  (resp.  ||x||  ^  :=  sup.  UJ  )  . 

^  Z 

1  00 

It  is  well  known  that  (£  )*  is  isometric  to  Z  ,  the  standard  isometry  being 
(informally  described)  as  follows:  If  x  :=  (f^ , . . . )  e  d1  and 

OO 

, . . .)  el  then 


y  :=  (HJ.H2 
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(2.1.3)  y(x)  =  <x,y>  :=  z“=J 


Theorem  2.1.5:  If  1  _<  p,q  <  00  and  B  is  a  non- trivial  Banach  space. 


then  R  (R)  <  Rn  (B)  <  R  U°°)  =  R  U1)  . 

p.q  -  p.q  -  p.q  p»q 


Proof:  The  first  inequality  follows  from  Proposition  2. 1.2(a) (b), 

since  R  is  (isometric  to)  a  subspace  of  B.  The  second  follows  similarly  if 

B  is  finite  dimensional,  since  every  separable  Banach  space  can  be  isometrically 
00 

embedded  in  l  (as  is  well  known);  the  general  case  follows  by  Proposition 
2.1.2(c). 

Finally,  by  the  second  inequality  and  Proposition  2.1.3,  we  have  that 
for  any  two  a-fields  F  and  G  CM  , 


R  (F,G;  J,°° )  =  R  (G.FjS,1)  <  R  (G,F;jT)  =  R  (F,G;  Jt1)  <  R  (F,G;  2°°) 

p.q  q.p  -  q.p  p.q  -  p.q 


This  implies  the  final  equality  in  Theorem  2.1.5.  This  completes  the  proof. 


For  the  next  theorem,  recall  eqn.  (0.4). 

Theorem  2.1.6:  For  every  non-trivial  Banach  space  B  the  following  three 
statements  hold: 

(a)  R.  (B)  =  R  (R)  yq.  1  5  q  1  00  • 

A  9  4  1  9  4 

(b)  Rp  l(B)  =  Rp>1(R)  VP.  1  i  P  1  00  • 

(c)  In  particular,  R  (B)  =  R  (R)  =  2$  . 

*  9  *  9  * 

Proof :  Let  us  take  (a)  for  granted  for  a  moment,  and  look  at  (b)  and 
(c) .  From  (a)  and  Proposition  2.1.3  (and  the  trivial  fact  that  R*  is 
isometric  to  R)  one  can  easily  derive  (b).  The  first  equality  in  (c)  is 
a  special  case  of  (a).  The  second  equality  in  (c)  is  well  known:  The 


inequality  Rj  ^(F.GjR)  <  2$(F,G)  follows  e.g.  from  [10,  pp.  11-12, 

Lemma  1.1.9],  and  the  opposite  inequality  follows  from  a  simple  argument 
based  on  the  identity 

E[IAaB-I  )]  -  EIAE(VI  )  =  2-  [P(B|A)-P(B)1  •  •  ||l„-I  ||„ 

B  B  B 

(say  with  A  e  F  ,  B  eG  ,  P(A)  >  0  )  .  Now  all  that  we  need  to  do  is 

prove  (a) . 

It  suffices  to  prove  that  Rj  ^(F, 6; B)  =  Rj  ^(F,G;R)  in  the  case  where 
1  £  q  £  00  is  arbitrary  but  fixed  and  F  and  G  are  finite  a-fields,  say 
with  F  generated  by  the  partition  {Aj,...,A-}  of  ft,  and  G  generated  by 
the  partition  {Bj,...,Bj}  of  ft  .  Without  loss  of  generality  we  assume 
that  P(Bj)  >  0  v j 

Suppose  X  and  Y  eS(G,B*)  .  It  suffices  to  prove  that 

(2.1.4)  |E<X,Y>  -  <EX,EY>1  <  R.  (F,G;R)  •  \\X\\.  •  ||Y|I  . 

i.,q  i  q 

Represent  X  and  Yby  X  :=  Z?  ,  x.  I.,..  and  Y  :*  E1!  ,  y.  I.,,,  , 
r  7  1=1  l  A(i)  j*l  7j  B(j) 

with  x.e  B  and  y^ e  B*  ,  with  the  events  A^, . . . ,Aj,Bj, . . . ,Bj  being  as 

above.  Then 

|E<X,Y>  -  <EX,EY> |  = 

*  *Si=l  Zj=l  <Xi*yj>  [PCAin  Sj)  -  P(A.)P(B.)]| 

i  ii  llxillB  *  II/jIIb*  *  lptAilB^  -  pc\)l  *  p(Bj) 

1  [^j=1  IIXjllJ*  *  P(B.)]1/q 

*[zj=i  (Ei«i  iixillB  *  lp(AJV  '  p(Vl)q,pcBj)]1/q' 
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*  INI,  •  ll£i«i  ll-ill,  •  II,. 

i  INI,  •  £1.1  II XA  I! B  •  ll«IA(1)  |6)  -  EIMi)  llq, 

i  11*11,  •  £U  H^IIb  •  Ri,,tF-c-w  •  11^(1,11, 

-  R1>q(F,G;K)  •  ||Z||t  •  ||Y||q 

by  Holder's  inequality,  Minkowski's  inequality,  and  Proposition  0,l(ii). 

Thus  eqn.  (2.1.4)  holds.  This  completes  the  proof. 

Remark  2.1.7;  We  finish  Section  2.1  with  the  following  three  comments: 

(a)  From  Theorem  2.1.5,  for  given  p  and  q  with  1  £  p,q  £  00  ,  one  can 

interpret  R  (4°°)  (*  R  (41))  as  R  ("Banach  space"),  i.e.  sup-  R  (B) 
p,q  p»  q  p,q  rB  p,q 

(b)  For  given  p  and  q  (with  1  £  p,q  <  °°)  and  Banach  space  B,  one  has 
an  equality  analogous  to  Proposition  0.1,  namely 

R  (F,G;B)  =»  sup  HE(X'6)  '  EXilq«  ,  XeS(F.B) 

P»n 

H*llp 

(For  XcS(F,B)  ,  the  definition  of  the  q'-norm  of  the  not-necessarily- 
simple  B-valued  function  E(X|G)-EX  will  be  straightforward.) 

(c)  Interpolation  theory  applies  nicely  to  B-valued  functions;  see 

[1,  Chapter  5].  In  (3,  Remark  4.1]  it  was  noted  that  by  the  Riesz  (or 
Riesz-Thorin)  interpolation  theorem,  there  are  only  four  equivalence  classes 
for  the  measures  of  dependence  R^  q  in  the  lower  triangle  1/p  *  1/q  £  1  , 

represented  respectively  by,  say,  R^,  «,  *  »  »  R®  \  »  and  R2  2  *  Similarly, 

by  corresponding  interpolation  theorems  for  B-valued  functions,  for  any 

given  Banach  space  B  there  are  at  most  four  equivalence  classes  for  the 

measures  of  dependence  R^  q(B)  in  the  lower  triangle  1/p  +  1/q  <  1  , 


represented  by,  say,  ^(B)  ,  ^(B)  ,  R^CB)  ,  and  R2>2(B)  *  By 


Theorem  2.1.6,  R,  (B)  and  Rm  ,  (B)  do  not  depend  on  the  choice  of  (non- trivial) 

00 1  1 

B;  but  as  we  shall  see  in  the  next  section,  R^  ^(B)  can  significantly  vary 
with  B. 

SECTION  2.2:  ABSOLUTE  REGULARITY 

The  measure  of  dependence  0  in  eqn.  (0.7)  is  the  basis  for  the  well 

known  "absolute  regularity"  [17]  condition  for  stochastic  processes.  It  is 

already  known  that  0  is  not  equivalent  to  any  of  the  other  measures  of 

dependence  given  in  eqns.  (0.3) -(0.6)  and  (0.8),  or  to  any  of  the  measures  of 

dependence  o  (0<r,s<l)  or  R  (l<p,q<«°)in  eqns.  (0.10) -(0.11) 
r,  s  p»  q  “  ~ 

at  all.  (More  on  this  shortly.)  Also,  it  seems  that  many  limit  theorems 

for  real-valued  (or  even  H-valued)  r.v.'s  under  the  "strong  mixing"  condition 

(which  is  based  on  the  measure  of  dependence  a  in  eqn.  (0.3))  cannot  be 

carried  over  to  more  general  B-valued  r.v.'s  unless  the  assumption  of 

strong  mixing  is  replaced  by  the  (stronger)  assumption  of  absolute  regularity. 

See  e.g.  the  discussion  and  results  in  Dehling  and  Philipp  [5].  To  some 

extent,  absolute  regularity  appears  to  be  the  most  natural  mixing  condition 

for  limit  theory  for  B-valued  r.v.'s.  Indeed,  the  measure  of  dependence  0 

has  a  quite  "natural"  interpretation  in  terms  of  the  measures  of  dependence 

R  (B) ;  in  Theorem  2.2.1  below,  it  will  be  shown  with  a  totally  elementary 
P»q 

argument  that  0  *  (1/2)  •  R  (1°)  (■  (1/2)  •  Rm  (£*))  . 

»  » 

Now  let  us  clarify  the  comparisons  between  0  and  the  measures  of 

dependence  a  ,  R  in  eqns.  (0. 10)- (0. 11) .  Recall  the  inequalities 
*  P»S 

a  1  3  1  ♦  (see  eqns.  (0.3)-(0.4)) .  Also,  0  does  not  dominate  p  (see 
eqn.  (0.6)).  (For  a  o-field  F={fl,A,Ac,$}  with  P (A)  *  e  (say  with  e 
small),  one  has  0(F,F)  =  2e(l-e)  and  p(F,F)  =  1  .)  From  these  facts  and 
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the  remarks  in  Section  1.2,  one  has  that  6  dominates  a  only  for 

r»  s 

r  +  s  <  1  ,  that  6  dominates  R^  ^  only  for  1/p  ♦  1/q  <  1  ,  and  that  $ 

is  dominated  by  a  if  either  r=l  or  s  *  1  ,  and  hence  by  R  if 
r,s  7  p,q 

either  p*l  or  q  =  1  .  In  Theorem  2.2.3  below  we  shall  show  that  3 
is  not  dominated  by  a  if  r  <  1  and  s  <  1  ,  or  even  by  R  if  p  >  1 

T|  S  P»<J 

and  q  >  1  . 


Theorem  2.2.1:  R„  _(8°°)  =  R_  _(8*)  =  26. 

Proof:  By  Theorem  2.1.5  it  suffices  to  prove  that  R _ (8,*)  *  26. 

______  °°>  0° 

Since  these  measures  of  dependence  automatically  satisfy  eqn.  (0.1),  it 
suffices  to  show  that  if  F  and  6  are  finite  o- fields  then 

R«oJF»G;4*)  ■ 2  *  e(F.G)  • 

9 

Suppose  F  is  a  finite  o-field,  generated  by  a  partition  {Aj,...,Aj} 
of  ft,  and  G  is  a  finite  o-field,  generated  by  a  partition  {B1,...,BJ}  of  ft. 

Without  loss  of  generality  we  assume  that  P(Ai)  >  0  Vi  and  P(B.)  >0  Vj  . 

1  00 

Recall  that  (8  )*  is  isometric  to  8  *  we  shall  use  the  natural 

identification  of  these  two  spaces  described  in  eqn.  (2.1.3). 

1  00 

If  X  cS(F,8  )  and  Y  €S(G,8  )  then  one  can  represent  these  r.v.'s  by 
X  :  =  xA  IA^j  and  Y  :■  E*jaj  y^  (with  each  ej,1  and  each 

y.ef  )  ,  and  one  has 

|E<X,Y>  -  <EX,EY> J  « 

-  |E?  ,  rj  ,  <x. ,y.>  (P(A.n  B.)  -  P(A. )P(B .)) I 
1  1=1  j=l  i"j  v  l  j  l  y'' 

I  J 

<  (sup  llxjl  j)  •  (sup  ||y  II  J  •  E  E  |P(A  "  B  )  -  P(A.)P(B  )  | 
i  8  j  J  8  i=l  j=l  J  J 

•  l|X|L*  II Y  •  26(F,G)  . 


^  m  «  ,  »  .  *  ,  ■  i  *  •  *  «  •  «  »  •  •  ■.  *  •  ",  •'«'»•%  *  ry  ^  •  •  •  •,*  »tr»  •  .  '  .  *  . 


(2.2.1) 


Hence  R  (F,G;Jl  )  £  2 0(F,G)  .  Conversely,  in  eqn.  (2.2.1)  equality  is 
00>°° 

achieved  if  we  take 

xi  :=  (0, ... ,0, 1,0,0, .. .)  (where  the  1  is  the  ith  coordinate), 

Yj  • “  (Yj j>yj2*Yj3» • ••)  where 

Yjk  :=  (sign  of  [P(Ak  nB^)  -  P(Ak)P(Bj ) ]  if  1  <  k  <  I 
0  if  k  >  I 

Hence  oo(F,G;H^)  =  28(F,G)  .  This  completes  the  proof. 


Corollary  2.2.2:  For  every  Banach  space  B,  R^  ^(B)  <_  23  . 


This  just  follows  from  Theorems  2.1.5  and  2.2.1 


Theorem  2.2.3:  Suppose  that  l<p,q  £  00  .  Then  (on  some  probability 

space)  the  following  two  statements  hold: 

(i)  R  fails  to  dominate  3  • 

p.q 

(ii)  For  each  pj  and  qj  such  that  1  i  Pj^i  1  °°  >  Rp  ^  fails  to  dominate 

R  .  U°°)  . 

prqi 

Here  (ii)  is  a  trivial  corollary  if  (i)  and  Theorem  2.2.1.  To  prove 
(i)  we  shall  show  in  Example  2.2.4  below  that  for  different  choices  of  pairs 
of  o-fields  F  and  G,  on  different  choices  of  probability  spaces,  one  can 
(simultaneously)  have  8(F,G)  »  1/2  and  R  (F,G;R)  arbitrarily  small. 

p*  q 

Then  by  eqn.  (0.13)  and  Remark  0.4,  (i)  will  follow. 

Example  2.2.4:  Let  p  and  q  be  arbitrary  but  fixed,  such  that  1 <  p,q  < 
We  shall  show  that  Rp  ^(F,G;R)  can  be  arbitrarily  small  for  o-fields  F 
and  G  satisfying  B(F,G)  =  1/2  .  Now  R  is  non- increasing  as  p  and/or  q 

p*  q 

increases;  without  loss  of  generality  we  impose  the  following  further 
restrictions  on  p  and  q: 


v.v.\v /-v-v-v-* .•*\'V*v-Vv’wV-v \'--v  v  •  •-v-  SI-v 

9  »  *  *  "  4  %  4  **  4,4^4,1l<  4  n  *  •  *  •  '  4  ^  4  #  »  *  •  .*4  .  1  a  '  *  i  *  •  4  T  4  %  4  *  4  "  ,***4*4* 
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(2.2.2) 

1  <  p  <  2 

(2.2.3) 

1  <  q  <  * 

Thus  q1 

satisfies  1  <  q»  <  00 

The  following  elementary  fact  will  be  needed  later  on: 


(2.2.4)  s«,  |ak|p) 


1/P  >  C  £  , 

—  k*l 


W2>1/2 


(Let  us  quickly  review  this.  Recall  the  elementary  inequality  (a+b)*  ^  a*  +  b* 
for  a  >  0,  b  0,  t  1  .  By  induction,  (£™=1  c^)*  i  c£  f°r  t  i  1 
and  non-negative  Cj,...,c  .  Now  plug  in  c^  :■  lajclP  and  1  :=  2/p  » 

using  eqn.  (2.2.2).) 

Now  let  n  be  an  arbitrary  but  fixed  positive  integer.  Construct  the 
probability  space  (£2,M,P)  as  follows: 

Define  £2  :«  {1,2, ...,2n}  x  [0,1]  .  Define  M=  the  family  of  Borel 

subsets  of  £2.  Define  the  probability  measure  Pj  on  {l,2,...,2n}  (with  the 
standard  discrete  o-field)  by  Pj({j})  *  (2n)_1  ,  j  =  l,2,...,2n  .  Define 

the  probability  measure  P2  on  [0,1]  (with  the  Borel  o-field)  to  be  Lebesgue 
measure.  Finally,  define  the  probability  measure  P  on  (£2,M)  such  that  P  is 
absolutely  continuous  with  respect  to  Pj  x  P2  »  such  that 


(2.2.5)  — — -  (j.x) 

(KPjXP^ 


1  ♦  hj  (x) 

if  1  £  j  <  n 

■  - 

if  n  ♦  1  <  j  <  2n 

where  hj,  h2,  ...  are  the  Rademacher  functions  on  [0,1]. 

Note  that  the  marginals  of  P  on  {l,2,...,2n}  and  [0,1]  are  Pj  and  P2 
respectively. 


Let  us  define  the  "marginal"  a-fields 
F  :»  {Ax  [0,1]:  A  is  a  subset  of  {1,2, . . . ,  2n}} 

6  :*  {{l,2,...,2n)xB:  B  is  a  Borel  subset  of  [0,1]} 

By  Proposition  0.1, 

||E(f  |G)  -  Ef  || 

(2.2.6)  R  (F,G;R)  «  sup _ i_  ,  feS(F,R) 

imiP 

Let  f  be  an  arbitrary  but  fixed  (simple)  real-valued,  F-measurable  r.v. 

We  shall  first  obtain  an  upper  bound  on  ||E(f|G)  -  Ef  || ^,  /|jf||p 

This  function  f  *  f(j,x)  depends  only  on  j;  we  can  use  the  notation 
f(j]  :■  f(j,x)  Vj  *  1, 2, . . . ,  2n  (x«[0,l])  . 

Define  the  functions^  and  f^  on  {l,2,...,2n}  as  follows: 

(2.2.7)  fj(j)  :»  -fjfj+n)  :■  (f(j)  -  f(j+n))/2  Vj  *  1,2, ...,n 

(2.2.8)  f2(j)  :=  f2(j+n)  (f(j)  *  f(j+n))/2  vj  =  1,2, ...,n 

Then  vj  *  1,2, ...,2n  ,  f(j)  *  fj(j)  +  f 2C j )  . 

For  a.e.  x  e[0, 1]  (and  every  JU  {1,2, ...,2n}  ),  we  have  by  eqn.  (2.2.5) 
and  some  simple  calculations,  E(f|G)(£,x)  -  Ef  *  fj(j)  *h^.(x)/n  . 

Therefore 

||E(f  |G)  -  Ef  ||  q  *  *  (1/n)  •  [/J  |  f^j)  h.  (x) |q'dx] 1/q' 

By  eqn.  (2.2.7)  and  the  convexity  of  the  function  u^  (for  nonnegative 
u),  we  have  that  IJfjHp  <  ||f  ||p  .  Also  ||fj  ||p  -[(1/n)  •  |  fj(j)  |P] 1/p 

by  a  trivial  calculation.  Hence  by  eqn.  (2.2.4)  and  Khinchin's  inequality 
(see  e.g.  [20,  Chapter  5,  Theorem  (8.4)])  , 
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||E(f  |6)  -  Ef  ||a,  n(1/pM  [/l  |t"_i  f^j)  h.(»)|C|'dx]1/‘1' 

Mlp  ^  [  ij.!  l*,(j)|P]I/P 

n(l/p)-l  c(ql)  .  [£n^  tf1Cj)J2]1/2 

1  I  f1(i)l2l1/2 

*  n‘1/p'  •  C(q') 

where  the  constant  C(q')  depends  only  on  (our  fixed  value  of)  q'. 

Since  f  was  arbitrary,  we  have  by  eqn.  (2.2.6), 

R  AF  6;  R)  <  n‘1/p'  *C(q') 

p,q 

By  making  n  sufficiently  large  to  begin  with,  we  can  make  R  (F,G;R) 

P»<1 

arbitrarily  small.  Also,  by  eqn.  (2.2.5)  and  an  elementary  argument, 
regardless  of  the  value  of  n, 

0(F,G)  *  \  •  U  | - - —  -  1|  d(P.xP  ) 

1  W  d(P1xP2)  1  1 

-I./fild(piXP2) 

1 

X  - 

2 

This  completes  the  discussion  for  Example  2.2.4,  and  thereby  completes  the 
proof  of  Theorem  2.2.3. 

Remark  2.2.5:  Gastwirth  and  Rubin  [7]  defined  for  each  t,  1  <  t  <  ®  , 
the  following  measure  of  dependence: 

At(F,G)  :=  ||var[PG(.|  F)  -  pG(.)]||t 

where  ?r  is  the  restriction  of  the  measure  P  to  events  in  G.  For  finite 


o-fields  F  and  G  the  definition  of  A^CF.G)  is  clear;  and  for  general  o-fields 
F  and  G,  where  a  measure-theoretic  ambiguity  may  have  to  be  settled,  we  shall 
simply  stipulate  that  eqn.  (0.1)  be  satisfied  by  At.  It  is  easy  to  see  that 
A,  *28,  A  *2$,  and  that  A„  is  equivalent  to  6  if  l<t<«.  By  a  simple 
argument  similar  to  that  of  Theorems  2.1.6  and  2.2.1,  one  has  that  for  each 
t,  1  sts®, 


A 


t 


SECTION  2.3:  THE  HILBERT  SPACE  CASE 


In  [3,  Section  4.3]  measures  of  dependence  based  on  H- valued  (Hilbert  space 

valued)  random  variables  were  examined.  Here  we  shall  extend  the  main  result 

of  that  section.  As  earlier  it  suffices  to  consider  real  spaces. 

Let  H  be  an  arbitrary  real  Hilbert  space,  with  inner  product  <*,•>. 

For  each  p,  1  sp<“,  the  quantity  Z  :*  [ (2tt) _j x|Pexp(-x2/2)dx]  1/,p 

P 

(the  p-norm  of  a  standard  N(0,1)  r.v.)  will  be  useful. 


Theorem  2.3.1.  If  H  is  a  non-trivial  Hilbert  space,  (p,q)  e  [  1  ,«*>]  ,  and 


F,GcM,  then  R  (F,G;H)sA*R  (F,G)  where  A  =  A(p,q)  is  a  function  only 

P»4 

of  p  and  q.  Further,  if  1  £p,q<®  then  one  can  take  A(p,q)  =Z  -Z  for 

pi  qi 

any  choice  of  pj  and  q1  satisfying  pSp^*:®,  and 

p^+q^Sl;  and  in  particular,  if  lSp,q<2  then  R^  ^(F,G;H)  =  R^  ^(F,G;]R). 


2 

Thus  for  a  given  (p,q)  e  [  1 ,°°]  the  measure  of  dependence  R  (H)  is 

p  >q 

linearly  equivalent  to  R  (F)  (or  to  R  in  eqn.  (0.10)).  For  the  case 

p.q  p.q 

p=q=oo,  this  was  shown  by  Dehling  and  Philipp  [5,  p.  692,  Lemma  2.2],  who 
used  the  Grothendieck  inequality  in  their  proof.  Under  the  restriction 
p“1+q'1sl  one  has  Theorem  2.3.1  from  [3,  Theorem  4.2];  the  purpose  here  is 


to  remove  that  extra  restriction.  Theorem  2.3.1  is  not  a  particularly  new 


result,  but  rather  an  adaptation  to  our  context  of  results  well  known  in 
functional  analysis. 


The  very  last  case  of  Theorem  2.3.1  (lsp,q<2)  follows  from  the  rest  of 

that  theorem  by  taking  Pj»q  =2.  (See  eqn.  (2.3.1)  below.) 

The  proof  of  Theorem  2.3.1  will  be  based  on  the  following  lemma: 

Lemma  2.3.2.  Suppose  1  s  p  s  Pj  £  1  £  q  s  qJ  <  °°,  and  p'1  +  q"1  £  1 .  Sup¬ 

pose  Xj,...,  Xj  and  Y^,...,  Yj  are  real-valued  (or  complex- valued)  random 
variables.  Then 


Proof. 


*  II  II  (Vi.lV^-ll  II  ‘Vj-1  V'q, 


Here  the  first  inequality  comes  from  Holder's  inequality  (and  the  fact  that 
our  measure  is  a  probability  measure) .  To  show  the  second  inequality  it  is 
enough  to  show  that 


II  II  (x.) 


I 

i=l 


II  ctlx.ll  ) 

1  pi 


i 

i=l 


If  Pj  s0°  then  this  is  simple.  (Also,  if  p  =  l  then  this  is  simply  Minkowski's 


inequality.)  If  Pj  <°°  then 


IIIKvl.ill  pH,  ■ l«I  l*ilp)Pl/p]1/Pl 

1  1  P1  i-1  1 


[H  j1IXilPllp}/P1‘/Pl  ’  1  "  1I,'XilPl'  P,/P 


|1/P 
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*  1 1  ii  txiipnD  /pj1/p  - 1 1  [E|x.r>v>Y/'’i]1/p 

i-1  1  pl/p  i=l 


£p 


This  completes  the  proof  of  Lemma  2.3.2. 

Proof  of  Theorem  2.3.1.  For  the  cases  p=®  or  q=°°,  see  [3,  Theorem 
4.2J.  Here  we  only  consider  the  case  where  1  sp<<®  and  1  i  q  <®.  As  in  the 
proof  of  [3,  Theorem  4.2]  we  restrict  our  attention  to  an  arbitrary  finite¬ 
dimensional  real  Hilbert  space  H;  we  use  the  same  Gaussian  probability 
measure  y  on  H  as  was  used  there.  (The  use  of  a  Gaussian  measure  here  is 
similar  to  its  use  in  Rietz  114].) 

Suppose  X  :=  anc*  Y  :=  ^j-l^^Cj)  where  (Aj,...,  Aj.} 

and  {Bj,. . . ,  Bj}  are  each  a  partition  of  ft  with  e  F  for  all  i  and 
Bj€G  for  all  j,  and  Xj,...,  Xj  and  y^...,  yj  are  elements  of  H.  To 
prove  Theorem  2.3.1  it  suffices  to  prove  that 

(2.3.1)  | E<X, Y>  -  <EX,EY>|  S  R  (F,G;1R)  •  Z  •  Z  *||X|L  *||Y|L 


P,q 


Pi  ^i 


holds  for  every  choice  of  Pj  and  qj  meeting  the  specifications  psp^®, 
qSq1<»,  and  p^+q^Sl. 

Before  we  prove  eqn.  (2.3.1),  a  couple  of  preliminary  observations  will 
be  needed.  For  any  x.y  e H  one  has  <x,y>  =  /H<x,u>*<y ,u>y(du) .  (This  was 
used  in  the  proof  of  [3,  Theorem  4.2].)  For  any  xeH  the  r.v.  V:H->-]R 
defined  by  V(u)  :*>  <x,u>  is  a  N(0,  ||  x||H )  r.v.  (on  the  probability  space 
(H,y)),  and  hence  for  any  t,  1  st  <®,  /Hl<x,u>|ty(du)  =  ||x||*  •  Z*. 

These  observations,  Fubini's  theorem,  and  Lemma  2.3.2  (on  the  probability 
space  (H,y))  will  now  be  used  to  prove  eqn.  (2.3.1),  under  the  given 


49 


specifications  on  p^  and  q^,  as  follows: 

|n<X,Y>  -  <EX,EY> | 

»  |/E(<X,u>»<Y,u>)y(du)  -  /<EX,u>*<EY,u>Y(du) | 

H  H 

=  |/ [E(<X,u>»<Y,u>)  -  E<X,u>*E<Y,u>]Y(du) | 

H 

S  /R  n(F,G;F>[E|<X,u>|P]1/p*[E|<Y>u>|q]1/qY(du) 


H 


=  R 


p.q 


■p>q(F  ,G;  F)  /||  (<x.,u>-P1/p(A.))J  =  1||^p  •||(<yj,u>*P1/qCB.))j=1| 

s  R  (F ,G; R)  •||((/l<xi,u>*P1/p(A  )|PlY(du))1/Pl)J  |I 

p.q  H  1  1  1-1  & 

•||((/l<yi»u>-P1/q(Bi)|qiY(du))1/ql)'J  || 

H  1  J  J-A  £q 

-  R  (F,G;K)  •H(P1/P(Ai)*(/|<xi,u>|PlY(du))1/Pl){  II 

p.q  1  h  1-1  ^p 

•||(P1/q(B  )  •(/!</  ,u>|qiYCdu))1/ql)^  II 
1  H  J  3-1  £q 

•  Rp  q(F,G;lR)  •[  IiP(Ai)-(||x.||Pl-ZPl)P/Pl]1/p 

•t  l  P(B.) «(||y. |||J1*zqi)q/qi]1/q 
j=i  j  j  n  qi 

■  ».  „CF.G;K)  •!  •!  •||XIL,IIYII 


Pj  qj  '  p  q 

Thus  eqn.  (2.3.1)  holds.  This  completes  the  proof  of  Theorem  2.3.1. 
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ADDENDUM. 


In  connection  with  Theorem  1.1.1  and  13,  Theorem  3.6],  here  are  some 
extra  comments,  added  when  this  report  was  in  the  final  stages  of  its  prepa¬ 
ration.  The  multilinear  form  B  in  Theorem  1.1.1  automatically  has  the  property 
that  is  a  "product  form"  (in  the  terminology  used  in  [3,  Theorem  3.6]).  For 
any  given  £,  for  t>0  sufficiently  small  one  can  refine  Theorem  1.1.1  by  having 
d^(B)  =t,  with  B  itself  being  a  product  form,  (without  disturbing  the  other 
properties  in  Theorem  1.1.1).  (The  positive  constant  C=C(£)  may  have  to  be 
made  smaller,  but  that  is  of  no  significance.)  To  carry  out  this  refinement. 


we  can  restrict  our  attention  to  the  case  where  l<Pjt<“  for  some  k,  as  in 
the  proof  of  Theorem  1.1.1.  By  examining  the  bottom  four  lines  of  p.  12  (and 


the  two  top  lines  of  p.  13)  in  the  case  where  Bk  =  (0, 5j]  for  all  k,  we  see 
that  d^(B)s2~n+1t  in  that  construction.  For  t>0  sufficiently  small 
(depending  on  ]>)  we  can  start  by  simply  using  that  construction  with  t  replaced 
by  2nt.  Then  2t  s d^(B)  S2nt.  To  then  modify  the  construction  so  that 
d^(B)  =t  and  B  is  a  product  form  (without  disturbing  the  other  properties  in 
Theorem  1.1.1)  we  simply  define  the  number  a  :=  t/d^(B)  and  then  replace 
the  probability  measure  P  in  the  construction  by  the  new  probability  measure 
a»P  +  (1  -  a)»[Lebesgue  measure  on  [0,l]n]. 


This  refinement  can  also  be  easily  worked  out  with  B  being  the  n-dimen- 
sional  cumulant,  because  of  the  fact  (see  p.  12,  line  7)  that  the  (n  -  ^-dimen¬ 
sional  marginals  of  the  given  probability  measure  P  are  simply  (n-l)-dimen- 
sional  Lebesgue  measure.  If  one  isn't  concerned  about  the  particular  formula 
for  the  multilinear  form  B,  then  (for  a  given  g)  "not  so  small"  values  of 
tsl  can  also  be  covered  in  this  refinement  by  simply  using  the  multilinear 
form  B(fj,...,  f^)  *  t»E(f ^ • . . . »f  )  on  a  trivial  probability  space  con¬ 
sisting  of  just  a  single  point.  Also,  Remark  1.1.2  can  be  refined  (with  an 


appropriate  linear  functional)  in  an  exactly  analogous  manner  as  above. 


